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Preface to the Third Edition 


I have taken advantage of this new edition of my book on differential equa¬ 
tions to add two batches of new material of independent interest: 

First, a fairly substantial appendix at the end of Chapter 1 on the famous 
bell curve. This curve is the graph of the normal distribution func¬ 
tion, with many applications in the natural sciences, the social sciences, 
mathematics—in statistics and probability theory—and engineering. We 
shall be especially interested how the differential equation for this curve 
arises from very simple considerations and can be solved to obtain the equa¬ 
tion of the curve itself. 

And second, a brief section on the van der Pol nonlinear equation and its 
historical background in World War II that gave it significance in the devel¬ 
opment of the theory of radar. This consists, in part, of personal recollections 
of the eminent physicist Freeman Dyson. 

Finally, I should add a few words on the meaning of the cover design, for 
this design amounts to a bit of self-indulgence. 

The chapter on Fourier series is there mainly to provide machinery needed 
for the following chapter on partial differential equations. Flowever, one of 
the minor offshoots of Fourier series is to find the exact sum of the infinite 
series formed from the reciprocals of the squares of the positive integers 
(the first formula on the cover). This sum was discovered by the great Swiss 
mathematician Euler in 1736, and since his time, several other methods for 
obtaining this sum, in addition to his own, have been discovered. This is one 
of the topics dealt with in Sections 34 and 35 and has been one of my own 
minor hobbies in mathematics for many years. 

Flowever, from 1736 to the present day, no one has ever been able to find 
the exact sum of the reciprocals of the cubes of the positive integers (the sec¬ 
ond formula on the cover). Some years ago, I was working with the zeroes 
of the Bessel functions. I thought for an exciting period of several days 
that I was on the trail of this unknown sum, but in the end it did not work 
out. Instead, the trail deviated in an unexpected direction and yielded yet 
another method for finding the sum in the first formula. These ideas will be 
found in Section 47. 
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Preface to the Second Edition 


"As correct as a second edition"—so goes the idiom. I certainly hope so, and 
I also hope that anyone who detects an error will do me the kindness of let¬ 
ting me know, so that repairs can be made. As Confucius said, "A man who 
makes a mistake and doesn't correct it is making two mistakes." 

I now understand why second editions of textbooks are always longer 
than first editions: as with governments and their budgets, there is always 
strong pressure from lobbyists to put things in, but rarely pressure to take 
things out. 

The main changes in this new edition are as follows: the number of prob¬ 
lems in the first part of the book has been more than doubled; there are 
two new chapters, on Fourier Series and on Partial Differential Equations; 
sections on higher order linear equations and operator methods have been 
added to Chapter 3; and further material on convolutions and engineering 
applications has been added to the chapter on Laplace Transforms. 

Altogether, many different one-semester courses can be built on various 
parts of this book by using the schematic outline of the chapters given on 
page xix. There is even enough material here for a two-semester course, if the 
appendices are taken into account. 

Finally, an entirely new chapter on Numerical Methods (Chapter 14) has 
been written especially for this edition by Major John S. Robertson of the 
United States Military Academy. Major Robertson's expertise in these mat¬ 
ters is much greater than my own, and I am sure that many users of this new 
edition will appreciate his contribution, as I do. 

McGraw-Hill and I would like to thank the following reviewers for their 
many helpful comments and suggestions: D. R. Arterburn, New Mexico 
Tech; Edward Beckenstein, St. John's University; Harold Carda, South Dakota 
School of Mines and Technology; Wenxiong Chen, University of Arizona; 
Jerald P. Dauer, University of Tennessee; Lester B. Fuller, Rochester Institute 
of Technology; Juan Gatica, University of Iowa; Richard H. Herman, The 
Pennsylvania State University; Roger H. Marty, Cleveland State University; 
Jean-Pierre Meyer, The Johns Hopkins University; Krzysztof Ostaszewski, 
University of Louisville; James L. Rovnyak, University of Virginia; Alan 
Sharpies, New Mexico Tech; Bernard Shiftman, The Johns Hopkins 
University; and Calvin H. Wilcox, University of Utah. 


George F. Simmons 
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Preface to the First Edition 


To be worthy of serious attention, a new textbook on an old subject should 
embody a definite and reasonable point of view which is not represented by 
books already in print. Such a point of view inevitably reflects the experi¬ 
ence, taste, and biases of the author, and should therefore be clearly stated at 
the beginning so that those who disagree can seek nourishment elsewhere. 
The structure and contents of this book express my personal opinions in a 
variety of ways, as follows. 

The place of differential equations in mathematics. Analysis has been the 
dominant branch of mathematics for 300 years, and differential equations 
are the heart of analysis. This subject is the natural goal of elementary cal¬ 
culus and the most important part of mathematics for understanding the 
physical sciences. Also, in the deeper questions it generates, it is the source 
of most of the ideas and theories which constitute higher analysis. Power 
series, Fourier series, the gamma function and other special functions, inte¬ 
gral equations, existence theorems, the need for rigorous justifications of 
many analytic processes—all these themes arise in our work in their most 
natural context. And at a later stage they provide the principal motivation 
behind complex analysis, the theory of Fourier series and more general 
orthogonal expansions, Lebesgue integration, metric spaces and HiIbert 
spaces, and a host of other beautiful topics in modern mathematics. I would 
argue, for example, that one of the main ideas of complex analysis is the 
liberation of power series from the confining environment of the real num¬ 
ber system; and this motive is most clearly felt by those who have tried to 
use real power series to solve differential equations. In botany, it is obvious 
that no one can fully appreciate the blossoms of flowering plants without 
a reasonable understanding of the roots, stems, and leaves which nourish 
and support them. The same principle is true in mathematics, but is often 
neglected or forgotten. 

Fads are as common in mathematics as in any other human activity, 
and it is always difficult to separate the enduring from the ephemeral in 
the achievements of one's own time. At present there is a strong current 
of abstraction flowing through our graduate schools of mathematics. This 
current has scoured away many of the individual features of the landscape 
and replaced them with the smooth, rounded boulders of general theo¬ 
ries. When taken in moderation, these general theories are both useful and 
satisfying; but one unfortunate effect of their predominance is that if a 
student doesn't learn a little while he is an undergraduate about such color¬ 
ful and worthwhile topics as the wave equation. Gauss's hypergeometric 
function, the gamma function, and the basic problems of the calculus of 
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variations—among many others—then he is unlikely to do so later. The 
natural place for an informal acquaintance with such ideas is a leisurely 
introductory course on differential equations. Some of our current books 
on this subject remind me of a sightseeing bus whose driver is so obsessed 
with speeding along to meet a schedule that his passengers have little or 
no opportunity to enjoy the scenery. Let us be late occasionally, and take 
greater pleasure in the journey. 

Applications. It is a truism that nothing is permanent except change; and 
the primary purpose of differential equations is to serve as a tool for the 
study of change in the physical world. A general book on the subject without 
a reasonable account of its scientific applications would therefore be as futile 
and pointless as a treatise on eggs that did not mention their reproductive 
purpose. This book is constructed so that each chapter except the last has 
at least one major "payoff"—and often several—in the form of a classic sci¬ 
entific problem which the methods of that chapter render accessible. These 
applications include 

The brachistochrone problem 

The Einstein formula £ = me 2 

Newton's law of gravitation 

The wave equation for the vibrating string 

The harmonic oscillator in quantum mechanics 

Potential theory 

The wave equation for the vibrating membrane 

The prey-predator equations 

Nonlinear mechanics 

Hamilton's principle 

Abel's mechanical problem 

I consider the mathematical treatment of these problems to be among the 
chief glories of Western civilization, and I hope the reader will agree. 

The problem of mathematical rigor. On the heights of pure mathematics, 
any argument that purports to be a proof must be capable of withstanding 
the severest criticisms of skeptical experts. This is one of the rules of the 
game, and if you wish to play you must abide by the rules. But this is not the 
only game in town. 

There are some parts of mathematics—perhaps number theory and abstract 
algebra—in which high standards of rigorous proof may be appropriate at 
all levels. But in elementary differential equations a narrow insistence on 
doctrinaire exactitude tends to squeeze the juice out of the subject, so that 
only the dry husk remains. My main purpose in this book is to help the 
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student grasp the nature and significance of differential equations; and to 
this end, I much prefer being occasionally imprecise but understandable to 
being completely accurate but incomprehensible. I am not at all interested 
in building a logically impeccable mathematical structure, in which defini¬ 
tions, theorems, and rigorous proofs are welded together into a formidable 
barrier which the reader is challenged to penetrate. 

In spite of these disclaimers, I do attempt a fairly rigorous discussion from 
time to time, notably in Chapter 13 and Appendices A in Chapters 5,6 and 7, 
and B in Chapter 11. 1 am not saying that the rest of this book is nonrigorous, 
but only that it leans toward the activist school of mathematics, whose pri¬ 
mary aim is to develop methods for solving scientific problems—in contrast 
to the contemplative school, which analyzes and organizes the ideas and 
tools generated by the activists. 

Some will think that a mathematical argument either is a proof or is not a 
proof. In the context of elementary analysis I disagree, and believe instead 
that the proper role of a proof is to carry reasonable conviction to one's 
intended audience. It seems to me that mathematical rigor is like clothing: in 
its style it ought to suit the occasion, and it diminishes comfort and restricts 
freedom of movement if it is either too loose or too tight. 

History and biography. There is an old Armenian saying, "He who lacks 
a sense of the past is condemned to live in the narrow darkness of his own 
generation." Mathematics without history is mathematics stripped of its 
greatness: for, like the other arts—and mathematics is one of the supreme 
arts of civilization—it derives its grandeur from the fact of being a human 
creation. 

In an age increasingly dominated by mass culture and bureaucratic imper¬ 
sonality, I take great pleasure in knowing that the vital ideas of mathemat¬ 
ics were not printed out by a computer or voted through by a committee, 
but instead were created by the solitary labor and individual genius of a 
few remarkable men. The many biographical notes in this book reflect my 
desire to convey something of the achievements and personal qualities of 
these astonishing human beings. Most of the longer notes are placed in the 
appendices, but each is linked directly to a specific contribution discussed 
in the text. These notes have as their subjects all but a few of the greatest 
mathematicians of the past three centuries: Fermat, Newton, the Bernoullis, 
Euler, Lagrange, Laplace, Lourier, Gauss, Abel, Poisson, Dirichlet, Hamilton, 
Liouville, Chebyshev, Hermite, Riemann, Minkowski, and Poincare. As 
T. S. Eliot wrote in one of his essays, "Someone said: The dead writers are 
remote from us because we know so much more than they did.' Precisely, and 
they are that which we know." 

History and biography are very complex, and I am painfully aware 
that scarcely anything in my notes is actually quite as simple as it may 
appear. I must also apologize for the many excessively brief allusions to 
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mathematical ideas most student readers have not yet encountered. But 
with the aid of a good library, sufficiently interested students should be 
able to unravel most of them for themselves. At the very least, such efforts 
may help to impart a feeling for the immense diversity of classical math¬ 
ematics—an aspect of the subject that is almost invisible in the average 
undergraduate curriculum. 


George F. Simmons 


Suggestions for the Instructor 


The following diagram gives the logical dependence of the chapters and sug¬ 
gests a variety of ways this book can be used, depending on the purposes 
of the course, the tastes of the instructor, and the backgrounds and needs of 
the students. 
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Suggestions for the Instructor 


The scientist does not study nature because it is useful; he studies it because 
he delights in it, and he delights in it because it is beautiful. If nature were not 
beautiful, it would not be worth knowing, and if nature were not worth knowing, 
life would not be worth living. Of course I do not here speak of that beauty that 
strikes the senses, the beauty of qualities and appearances; not that I undervalue 
such beauty, far from it, but it has nothing to do with science; I mean that pro¬ 
founder beauty which comes from the harmonious order of the parts, and which 
a pure intelligence can grasp. 

—Henri Poincare 

As a mathematical discipline travels far from its empirical source, or still more, 
if it is a second or third generation only indirectly inspired by ideas coming from 
"reality,"it is beset with very grave dangers. It becomes more and more purely 
aestheticizing, more and more purely l'art pour l'art. This need not be bad, if 
the field is surrounded by correlated subjects, which still have closer empirical 
connections, or if the discipline is under the influence of men with an excep¬ 
tionally well-developed taste. But there is a grave danger that the subject will 
develop along the line of least resistance, that the stream, so far from its source, 
will separate into a multitude of insignificant branches, and that the discipline 
will become a disorganized mass of details and complexities. In other words, at a 
great distance from its empirical source, or after much "abstract" inbreeding, a 
mathematical subject is in danger of degeneration. 

—John von Neumann 

Just as deduction should be supplemented by intuition, so the impulse to progres¬ 
sive generalization must be tempered and balanced by respect and love for color¬ 
ful detail. The individual problem shoidd not be degraded to the rank of special 
illustration of lofty general theories. In fact, general theories emerge from consid¬ 
eration of the specific, and they are meaningless if they do not serve to clarify and 
order the more particularized substance below. The interplay between generality 
and individuality, deduction and construction, logic and imagination—this is 
the profound essence of live mathematics. Any one or another of these aspects of 
mathematics can be at the center of a given achievement. In afar-reaching devel¬ 
opment all of them will be involved. Generally speaking, such a development will 
start from the "concrete" ground, then discard ballast by abstraction and rise to 
the lofty layers of thin air where navigation and observation are easy; after this 
flight comes the crucial test of landing and reaching specific goals in the newly 
surveyed low plains of individual "reality." In brief, the flight into abstract gen¬ 
erality must start from and return to the concrete and specific. 


—Richard Courant 


About the Author 


George Simmons has academic degrees from the California Institute of 
Technology, the University of Chicago, and Yale University. He taught at sev¬ 
eral colleges and universities before joining the faculty of Colorado College 
in 1962, where he is a Professor of Mathematics. He is also the author of 
Introduction to Topology and Modern Analysis (McGraw-Hill, 1963), Precalcidus 
Mathematics in a Nutshell (Janson Publications, 1981), and Calculus with Analytic 
Geometry (McGraw-Hill, 1985). 

When not working or talking or eating or drinking or cooking. Professor 
Simmons is likely to be traveling (Western and Southern Europe, Turkey, 
Israel, Egypt, Russia, China, Southeast Asia), trout fishing (Rocky Mountain 
states), playing pocket billiards, or reading (literature, history, biography and 
autobiography, science, and enough thrillers to achieve enjoyment without 
guilt). 
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Chapter 1 

The Nature of Differential 
Equations. Separable Equations 


1 Introduction 

An equation involving one dependent variable and its derivatives with 
respect to one or more independent variables is called a differential equation. 
Many of the general laws of nature—in physics, chemistry, biology, and 
astronomy—find their most natural expression in the language of differen¬ 
tial equations. Applications also abound in mathematics itself, especially in 
geometry, and in engineering, economics, and many other fields of applied 
science. 

It is easy to understand the reason behind this broad utility of differential 
equations. The reader will recall that if y =f(x) is a given function, then its 
derivative dy/dx can be interpreted as the rate of change of y with respect 
to x. In any natural process, the variables involved and their rates of change 
are connected with one another by means of the basic scientific principles 
that govern the process. When this connection is expressed in mathematical 
symbols, the result is often a differential equation. 

The following example may illuminate these remarks. According to 
Newton's second law of motion, the acceleration a of a body of mass m is 
proportional to the total force F acting on it, with 1/m as the constant of pro¬ 
portionality, so that a = F/m or 


ma = F. (1) 

Suppose, for instance, that a body of mass m falls freely under the influence 
of gravity alone. In this case the only force acting on it is mg, where g is the 
acceleration due to gravity. 1 If y is the distance down to the body from some 
fixed height, then its velocity v = dy/dt is the rate of change of position and its 
acceleration a = dv/dt = d 2 y/dt 2 is the rate of change of velocity. With this nota¬ 
tion, (1) becomes 


1 g can be considered constant on the surface of the earth in most applications, and is approxi¬ 

mately 32 feet per second per second (or 980 centimeters per second per second). 


1 
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Differential Equations with Applications and Historical Notes 


d 2 y 

m —= mg 
dt 2 & 


or 


djL 

dt 2 


= 8 - 


( 2 ) 


If we alter the situation by assuming that air exerts a resisting force propor¬ 
tional to the velocity, then the total force acting on the body is mg - k(dy/dt), 
and (1) becomes 



, dy 

mg - k ir 


(3) 


Equations (2) and (3) are the differential equations that express the essential 
attributes of the physical processes under consideration. 

As further examples of differential equations, we list the following: 


£ 

1 

II 

(4) 


(5) 

dy _ v 2 

— + 2 xy = e; 
dx 

(6) 


(7) 

( 1 -x 2 )^4-2x^ + p(p + % = 0; 
ax ax 

(8) 

x 2 ^ + x^ + (x 2 -p 2 )y = 0. 
dx~ dx 

(9) 


The dependent variable in each of these equations is y, and the independent 
variable is either f or x. The letters k, m, and p represent constants. An ordinary 
differential equation is one in which there is only one independent variable, so 
that all the derivatives occurring in it are ordinary derivatives. Each of these 
equations is ordinary. The order of a differential equation is the order of the 
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highest derivative present. Equations (4) and (6) are first order equations, 
and the others are second order. Equations (8) and (9) are classical, and are 
called Legendre’s equation and Bessel’s equation, respectively. Each has a vast 
literature and a history reaching back hundreds of years. We shall study all 
of these equations in detail later. 

A partial differential equation is one involving more than one independent 
variable, so that the derivatives occurring in it are partial derivatives. For 
example, if vo =f(x,y,z,t) is a function of time and the three rectangular coordi¬ 
nates of a point in space, then the following are partial differential equations 
of the second order: 


d 2 zv d 2 zv d~w 
—v + —v + — r = 0; 
dx 2 dy 2 dz 2 


( '->2 '■si 'si \ 

2 O VO O ZV O W 

■ - H - 7T~\ - 7T 

K dx 2 dy 2 dz 2 , 


dw _ 
dt ' 


f ^2 -\2 ^2 A 

2 O ZV O W O ZV 
' - o —I- n —I- 

y dx~ dy~ dz 2 J 


d 2 w 

~di r ‘ 


These equations are also classical, and are called Laplace's equation, the heat 
equation, and the wave equation, respectively. Each is profoundly significant in 
theoretical physics, and their study has stimulated the development of many 
important mathematical ideas. In general, partial differential equations arise 
in the physics of continuous media—in problems involving electric fields, 
fluid dynamics, diffusion, and wave motion. Their theory is very different 
from that of ordinary differential equations, and is much more difficult in 
almost every respect. For some time to come, we shall confine our attention 
exclusively to ordinary differential equations. 2 


2 The English biologist J. B. S. Haldane (1892-1964) has a good remark about the one-dimen¬ 
sional special case of the heat equation: "In scientific thought we adopt the simplest theory 
which will explain all the facts under consideration and enable us to predict new facts of the 
same kind. The catch in this criterion lies in the word 'simplest.' It is really an aesthetic canon 
such as we find implicit in our criticism of poetry or painting. The layman finds such a law as 

, d 2 zv dw 

a -—=- =- 

dx 2 dt 

much less simple than 'it oozes/ of which it is the mathematical statement. The physicist 
reverses this judgment, and his statement is certainly the more fruitful of the two, so far as 
prediction is concerned. It is, however, a statement about something very unfamiliar to the 
plain man, namely, the rate of change of a rate of change." 







4 


Differential Equations with Applications and Historical Notes 


2 General Remarks on Solutions 

The general ordinary differential equation of the nth order is 



( 1 ) 


or, using the prime notation for derivatives. 


F{x,y,y',y‘,...,yh)) = 0. 


Any adequate theoretical discussion of this equation would have to be 
based on a careful study of explicitly assumed properties of the function F. 
However, undue emphasis on the fine points of theory often tends to obscure 
what is really going on. We will therefore try to avoid being overly fussy 
about such matters—at least for the present. 

It is normally a simple task to verify that a given function y=y(x) is a solu¬ 
tion of an equation like (1). All that is necessary is to compute the derivatives 
of y(x) and to show that y(x) and these derivatives, when substituted in the 
equation, reduce it to an identity in x. In this way we see that 


y = e 2x and y = e 3x 


are both solutions of the second order equation 


( 2 ) 


V" -5y' + 6y = 0; 


and, more generally, that 


y = c t e 2x + c 2 e 3x 


( 3 ) 


is also a solution for every choice of the constants c 1 and c 2 . Solutions of dif¬ 
ferential equations often arise in the form of functions defined implicitly, 
and sometimes it is difficult or impossible to express the dependent variable 
explicitly in terms of the independent variable. For instance. 


*y=iogy+c 


( 4 ) 


is a solution of 


dy _ y 2 

dx 1 - xy 


( 5 ) 
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for every value of the constant c, as we can readily verify by differentiating 
(4) and rearranging the result. 3 These examples also illustrate the fact that 
a solution of a differential equation usually contains one or more arbitrary 
constants, equal in number to the order of the equation. 

In most cases procedures of this kind are easy to apply to a suspected 
solution of a given differential equation. The problem of starting with a dif¬ 
ferential equation and finding a solution is naturally much more difficult. 
In due course we shall develop systematic methods for solving equations 
like (2) and (5). For the present, however, we limit ourselves to a few remarks 
on some of the general aspects of solutions. 

The simplest of all differential equations is 

?=/«, ( 6 ) 

Al¬ 


and we solve it by writing 


y = jf(x)dx + c. (7) 

In some cases the indefinite integral in (7) can be worked out by the methods 
of calculus. In other cases it may be difficult or impossible to find a formula 
for this integral. It is known, for instance, that 

je~ x2 dx and J S ' n X dx 

cannot be expressed in terms of a finite number of elementary functions. 4 If 
we recall, however, that 



is merely a symbol for a function (any function) with derivative/(x), then we 
can almost always give (7) a valid meaning by writing it in the form 

y = jf(t)dt + c. (8) 

*0 


3 In calculus the notation In x is often used for the so-called natural logarithm, that is, the func¬ 
tion log, x. In more advanced courses, however, this function is almost always denoted by the 
symbol log x. 

4 Any reader who is curious about the reasons for this should consult D. G. Mead, "Integration," 

Am. Math. Monthly, vol. 68, pp. 152-156 (1961). For additional details, see G. H. Hardy, The 
Integration of Functions of a Single Variable, Cambridge University Press, London, 1916; or J. F. 
Ritt, Integration in Finite Terms, Columbia University Press, New York, 1948. 
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The crux of the matter is that this definite integral is a function of the upper 
limit x (the t under the integral sign is only a dummy variable) which always 
exists when the integrand is continuous over the range of integration, and 
that its derivative is f(x). 5 

The so-called separable equations, or equations with separable variables, are 
at the same level of simplicity as (6). These are differential equations that can 
be written in the form 


f-/<%<>/), 


where the right side is a product of two functions each of which depends 
on only one of the variables. In such a case we can separate the variables by 
writing 


- d f-= f{x)dx, 

§(y) 


and then solve the original equation by integrating: 



These are simple differential equations to deal with in the sense that the 
problem of solving them can be reduced to the problem of integration, even 
though the indicated integrations can be difficult or impossible to carry out 
explicitly. 

The general first order equation is the special case of (1) which corresponds 
to taking n = 1: 



( 9 ) 


We normally expect that an equation like this will have a solution, and 
that this solution—like (7) and (8)—will contain one arbitrary constant. 
However, 



5 This statement is one form of the fundamental theorem of calculus. 
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has no real-valued solutions at all, and 



2 


has only the single solution y-0 (which contains no arbitrary constants). 
Situations of this kind raise difficult theoretical questions about the exis¬ 
tence and nature of solutions of differential equations. We cannot enter here 
into a full discussion of these questions, but it may clarify matters if we give 
an intuitive description of a few of the basic facts. 

For the sake of simplicity, let us assume that (9) can be solved for dy/dx: 



( 10 ) 


We also assume that/(x,y) is a continuous function throughout some rectan¬ 
gle R in the xy plane. The geometric meaning of a solution of (10) can best be 
understood as follows (Figure 1). If P 0 = (x 0 ,i/ 0 ) is a point in R, then the number 



y 


R 



x 


FIGURE 1 
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determines a direction at P 0 . Now let P, = (x, i/,) be a point near P 0 in this 
direction, and use 



to determine a new direction at P v Next, let P 2 = (x 2 , y 2 ) be a point near P 1 in 
this new direction, and use the number 



to determine yet another direction at P 2 . If we continue this process, we 
obtain a broken line with points scattered along it like beads; and if we now 
imagine that these successive points move closer to one another and become 
more numerous, then the broken line approaches a smooth curve through 
the initial point P 0 . This curve is a solution y = y(x) of equation (10); for at each 
point (x,y) on it, the slope is given by fix,y) —and this is precisely the condition 
required by the differential equation. If we start with a different initial point, 
then in general we obtain a different curve (or solution). Thus the solutions 
of (10) form a family of curves, called integral curves . 6 Furthermore, it appears 
to be a reasonable guess that through each point in R there passes just one 
integral curve of (10). This discussion is intended only to lend plausibility to 
the following precise statement. 

Theorem A. (Picard's theorem.) lff(x,y) and df/dy are continuous functions on a 
closed rectangle R, then through each point (x 0 , y 0 ) in the interior ofR there passes a 
unique integral curve of the equation dy/dx-f(x,y). 

If we consider a fixed value of x 0 in this theorem, then the integral curve 
that passes through (x 0 , y 0 ) is fully determined by the choice of y 0 . In this 
way we see that the integral curves of (10) constitute what is called a one- 
parameter family of curves. The equation of this family can be written in the 
form 


y=y(x,c), 


( 11 ) 


where different choices of the parameter c yield different curves in the fam¬ 
ily. The integral curve that passes through (x 0 , y 0 ) corresponds to the value of 


6 Solutions of a differential equation are sometimes called integrals of the equation because the 
problem of finding them is more or less an extension of the ordinary problem of integration. 
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c for which y 0 -y(x 0 ,c). If we denote this number by c 0 , then (11) is called the 
general solution of (10), and 


y=y(x,c 0 ) 

is called the particular solution that satisfies the initial condition 

y=y 0 when x = x 0 . 

The essential feature of the general solution (11) is that the constant c in it can 
be chosen so that an integral curve passes through any given point of the 
rectangle under consideration. 

Picard's theorem is proved in Chapter 13. This proof is quite complicated, 
and is probably best postponed until the reader has had considerable experi¬ 
ence with the more straightforward parts of the subject. The theorem itself 
can be strengthened in various directions by weakening its hypotheses; it can 
also be generalized to refer to nth order equations solvable for the nth order 
derivative. Detailed descriptions of these results would be out of place in the 
present context, and we content ourselves for the time being with this infor¬ 
mal discussion of the main ideas. In the rest of this chapter we explore some 
of the ways in which differential equations arise in scientific applications. 


Problems 

1. Verify that the following functions (explicit or implicit) are solutions of 


the corresponding differential equations: 

U 

+ 

II 

y' = 2x; 

(b) y=cx 2 

* 

<< 

II 

£ 

+ 

X 

C L 

ll 

2^ 

<< 

<< 

II 

(d) y = ce kx 

£ 

II 

(e) y=c x sin lx + c 2 cos lx 

y"+4y = 0; 

(f) y=c 1 .e 2x +c 2 e~ 2x 

y" - 4y = 0; 

(g) y=c x sinh 2x + c 2 cosh 2x 

y" - 4y = 0; 

(h) y = sin' 1 xy 

*2/' + J/ = J/'V 1- 

(i) y=x tanx 

xy' = y + x 2 + y 2 ; 

(j) x 2 = 2y 2 logy; 

II 

-o 

, X 

S3 

> . 


x +y 

(k) y 2 -x 2 - cx 

2 xyy'=x 2 +y 2 ; 

(1) y = c 2 + c/x 

y+xy' -x\y') 2 -, 
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(m) y-ce y,x 

(n) y + siny = x 

(o) x + y = tan _1 y 


y'=y 2 /(*y - * 2 ); 

(y cos y - sin y + x)y' = y; 
l + y 2 +y 2 y' = 0. 


2. Find the general solution of each of the following differential equations: 

(a) y' = e 3x - x ; 

(b) xy' = l; 

(c) y'-xe x2 -, 

(d) y' = sin 1 x; 

(e) (l+x)y'=x; 

(f) (1 + x 2 )y'=x; 

(g) (l+x 3 )y'=x; 

(h) (1+x 2 )y'=tan _1 x; 

(i) x yy'=y-^ 

(j) x 5 y' + y 5 = 0; 

(k) xy' = (1 - 2x 2 ) tan y; 

(!) y' = 2xy; 

(m) y'siny = x 2 ; 

(n) y'sinx = l; 

(o) y' + y tan x = 0; 

(p) y' ~y tan x =0; 

(q) (1 + x 2 ) dy + (1 + y 2 ) dx = 0; 

(r) y log y dx - x dy = 0. 

3. For each of the following differential equations, find the particular 
solution that satisfies the given initial condition: 

(a) y' -xe x ,y-3 whenx=l; 

(b) y' = 2 sin x cos x, y = 1 when x = 0; 

(c) y' = log x, y = 0 when x = e; 

(d) (x 2 - l)y' = 1, y = 0 when x = 2; 

(e) x(x 2 - 4)y' = 1, y = 0 when x = 1; 

(f) (x + l)(x 2 + l)y' = 2x 2 + x, y = 1 when x = 0. 

4. For each of the following differential equations, find the integral curve 
that passes through the given point: 

(a) y' = e 3 *-2y, (o,0); 

(b) x dy = (2x 2 +1) dx, (1,1); 

(c) e~ y dx + (1 + x 2 ) dy = 0, (0, 0); 

(d) 3 cos 3x cos 2y dx - 2 sin 3x sin 2y dy = 0, (7t/12,n/8); 
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(e) y' = e x cos x, (0,0); 

(f) xyy' = (x+ l)(y+1), (1,0). 

2 f* -t 2 

5. Show that y = e x I e 1 dt is a solution of y' = 2 xy +1. 

J o 

6. For the differential equation (2), namely, 

y" ~ 5y' + 6y = 0, 

carry out the detailed calculations needed to verify the assertions in 
the text that 

(a) y = e 2x and y = e 3x are both solutions; and 

(b) y = c } e 2x + c 2 e 3x is a solution for every choice of the constants c, and c 2 . 
Remark: In studying a book like this, a student should never slide 
past assertions of this kind—involving such phrases as "we see" 
or "as we can readily verify"—without personally checking their 
validity The mere fact that something is in print does not mean it 
is necessarily true. Cultivate skepticism as a healthy state of mind, 
as you would physical fitness; accept nothing on the authority 
of this writer or any other until you have understood it fully for 
yourself. 

7. In the spirit of Problem 6, verify that (4) is a solution of the differential 
equation (5) for every value of the constant c. 

8. For what values of the constant m will y-e mx be a solution of the dif¬ 
ferential equation 

2 y m + y" - 5y' + 2y = 0 ? 


Use the ideas in Problem 6 to find a solution containing three arbitrary 
constants c lr c 2 , c 3 . 


3 Families of Curves. Orthogonal Trajectories 

We have seen that the general solution of a first order differential equa¬ 
tion normally contains one arbitrary constant, called a parameter. When this 
parameter is assigned various values, we obtain a one-parameter family of 
curves. Each of these curves is a particular solution, or integral curve, of the 
given differential equation, and all of them together constitute its general 
solution. 

Conversely, as we might expect, the curves of any one-parameter family 
are integral curves of some first order differential equation. If the family is 


f(x,y,c) = 0 , 


( 1 ) 
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then its differential equation can be found by the following steps. First, dif¬ 
ferentiate (1) implicitly with respect to x to get a relation of the form 

g [ x ' y -to' c y°- p) 

Next, eliminate the parameter c from (1) and (2) to obtain 

F i X,y 'dx) = ° (3) 

as the desired differential equation. For example, 

x 2 +y 2 -c 2 (4) 

is the equation of the family of all circles with centers at the origin (Figure 2). 
On differentiation with respect to x this becomes 

2x + 2y — = 0; 

J dx 



FIGURE 2 
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and since c is already absent, there is no need to eliminate it and 

x + y C y- = 0 (5) 

ax 

is the differential equation of the given family of circles. Similarly, 

x 2 +y 2 -2cx (6) 

is the equation of the family of all circles tangent to the y-axis at the origin 
(Figure 3). When we differentiate this with respect to x, we obtain 

2x + 2 v— = 2c 
J dx 

or 


dy 

x + 1 / — = c 
J dx 


( 7 ) 



FIGURE 3 
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The parameter c is still present, so it is necessary to eliminate it by combining 
(6) and (7). This yields 


dy _ if-x 2 
dx 1x\j 


( 8 ) 


as the differential equation of the family (6). 

As an interesting application of these procedures, we consider the prob¬ 
lem of finding orthogonal trajectories. To explain what this problem is, we 
observe that the family of circles represented by (4) and the family y = mx of 
straight lines through the origin (the dotted lines in Figure 2) have the fol¬ 
lowing property: each curve in either family is orthogonal (i.e., perpendicu¬ 
lar) to every curve in the other family. Whenever two families of curves 
are related in this way, each is said to be a family of orthogonal trajectories of 
the other. Orthogonal trajectories are of interest in the geometry of plane 
curves, and also in certain parts of applied mathematics. For instance, if 
an electric current is flowing in a plane sheet of conducting material, then 
the lines of equal potential are the orthogonal trajectories of the lines of 
current flow. 

In the example of the circles centered on the origin, it is geometrically 
obvious that the orthogonal trajectories are the straight lines through the 
origin, and conversely. In order to cope with more complicated situations, 
however, we need an analytic method for finding orthogonal trajectories. 
Suppose that 


| L = f(x,y) (9) 

dx 

is the differential equation of the family of solid curves in Figure 4. These 
curves are characterized by the fact that at any point (x,ij) on any one of 
them the slope is given by f(x,y). The dotted orthogonal trajectory through 
the same point, being orthogonal to the first curve, has as its slope the nega¬ 
tive reciprocal of the first slope. Thus, along any orthogonal trajectory, we 
have dy/dx ~ -\/f(x,y) or 


dx 

dy 


f{x,y). 


( 10 ) 


Our method of finding the orthogonal trajectories of a given family of curves 
is therefore as follows: first, find the differential equation of the family; next, 
replace dy/dx by -dx/dy to obtain the differential equation of the orthogonal 
trajectories; and finally, solve this new differential equation. 
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FIGURE 4 


If we apply this method to the family of circles (4) with differential equa¬ 
tion (5), we get 


r 

x + y ■ 

v 


dx' 
dl J, 


= 0 


or 


fy = y 

dx x 


( 11 ) 


as the differential equation of the orthogonal trajectories. We can now sepa¬ 
rate the variables in (11) to obtain 


dy _ dx 

y x ' 


which on direct integration yields 

log y = log x + log c 


or 


y = C x 


as the equation of the orthogonal trajectories. 
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FIGURE 5 


It is often convenient to express the given family of curves in terms of polar 
coordinates. In this case we use the fact that if \p is the angle from the polar 
radius to the tangent, then tan \p = r dQ/dr (Figure 5). By the above discussion, 
we replace this expression in the differential equation of the given family 
by its negative reciprocal, -dr/r dQ, to obtain the differential equation of the 
orthogonal trajectories. As an illustration of the value of this technique, we 
find the orthogonal trajectories of the family of circles (6). If we use rect¬ 
angular coordinates, it follows from (8) that the differential equation of the 
orthogonal trajectories is 


dy 2 xy 

dx x 2 - y 2 


( 12 ) 


Unfortunately, the variables in (12) cannot be separated, so without addi¬ 
tional techniques for solving differential equations we can go no further in 
this direction. However, if we use polar coordinates, the equation of the fam¬ 
ily (6) can be written as 


r=2c cos 0. 


(13) 
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From this we find that 


— = -2csin0, (14) 

dQ 

and after eliminating c from (13) and (14) we arrive at 

rdQ _ cos0 
dr sin 0 

as the differential equation of the given family. Accordingly, 

rdQ _ sin0 
dr cos 0 


is the differential equation of the orthogonal trajectories. In this case the 
variables can be separated, yielding 

dr cos 0 d0 
r sin0 


and after integration this becomes 

log r = log (sin 0) + log 2c, 


so that 


r = 2c sin 0 (15) 

is the equation of the orthogonal trajectories. It will be noted that (15) is the 
equation of the family of all circles tangent to the x-axis at the origin (see the 
dotted curves in Figure 3). 

In Chapter 2 we develop a number of more elaborate procedures for 
solving first order equations. Since our present attention is directed more 
at applications than formal techniques, all the problems given in this 
chapter are solvable by the method of separation of variables illustrated 
above. 


Problems 

1. Sketch each of the following families of curves, find the orthogonal tra¬ 
jectories, and add them to the sketch: 

(a) xy = c; 

(b) y=cx 2 ; 
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(c) r-c (1 +cos 0); 

(d) y = ce x . 

2. What are the orthogonal trajectories of the family of curves (a) y = cx 4 ; 
(b) y~cx n where n is any positive integer? In each case, sketch both 
families of curves. What is the effect on the orthogonal trajectories of 
increasing the exponent n? 

3. Show that the method for finding orthogonal trajectories in polar coor¬ 
dinates can be expressed as follows. If dr/dQ = F(r, 0) is the differential 
equation of the given family of curves, then dr/dQ--r 2 /F(r, 0) is the dif¬ 
ferential equation of the orthogonal trajectories. Apply this method to 
the family of circles r = 2c sin 0. 

4. Use polar coordinates to find the orthogonal trajectories of the family 
of parabolas r-c/( 1 - cos 0), c > 0. Sketch both families of curves. 

5. Sketch the family y 2 = 4c(x + c) of all parabolas with axis the x-axis and 
focus at the origin, and find the differential equation of the family. 
Show that this differential equation is unaltered when dy/dx is replaced 
by -dx/dy. What conclusion can be drawn from this fact? 

6. Find the curves that satisfy each of the following geometric conditions: 

(a) The part of the tangent cut off by the axes is bisected by the point of 
tangency. 

(b) The projection on the x-axis of the part of the normal between (x,y) 
and the x-axis has length 1. 

(c) The projection on the x-axis of the part of the tangent between (x,y) 
and the x-axis has length 1. 

(d) The part of the tangent between (x, y) and the x-axis is bisected by 
the y-axis. 

(e) The part of the normal between (x, y) and the y-axis is bisected by 
the x-axis. 

(f) (x, y) is equidistant from the origin and the point of intersection of 
the normal with the x-axis. 

(g) The polar angle 0 equals the angle \p from the polar radius to the 
tangent. 

(h) The angle \|/ from the polar radius to the tangent is constant. 

7. A curve rises from the origin in the xy-plane into the first quadrant. 
The area under the curve from (0,0) to (x, y) is one-third the area of the 
rectangle with these points as opposite vertices. Find the equation of 
the curve. 

8. Three vertices of a rectangle of area A lie on the x-axis, at the origin, 
and on the y-axis. If the fourth vertex moves along a curve y = y(x) in the 
first quadrant in such a way that the rate of change of A with respect to 
x is proportional to A, find the equation of the curve. 
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9. A saddle without a saddle-horn (pommel) has the shape of the sur¬ 
face z = y 2 - x 2 . It is lying outdoors in a rainstorm. Find the paths along 
which raindrops will run down the saddle. Draw a sketch and use it to 
convince yourself that your answer is reasonable. 

10. Find the differential equation of each of the following one-parameter 
families of curves: 

(a) y = x sin (x + c); 

(b) all circles through (1, 0) and (-1, 0); 

(c) all circles with centers on the line y = x and tangent to both axes; 

(d) all lines tangent to the parabola x 2 -4y (hint: the slope of the tangent 
line at (2a, a 2 ) is a); 

(e) all lines tangent to the unit circle x 2 + y 2 = 1. 

11. In part (d) of Problem 10, show that the parabola itself is an integral 
curve of the differential equation of the family of all its tangent lines, 
and that therefore through each point of this parabola there pass tivo 
integral curves of this differential equation. Do the same for the unit 
circle in part (e) of Problem 10. 


4 Growth, Decay, Chemical Reactions, and Mixing 

We remind the student that the number e is often defined by the limit 



or slightly more generally (put h = 1 / n), by the limit 


e = lim(l + h) Vh . 


( 1 ) 


In words, this says that e is the limit of 1 plus a small number, raised to 
the power of the reciprocal of the small number, as that small number 
approaches 0. 

We recall from calculus that the importance of the number e lies 
mainly in the fact that the exponential function y = e x is unchanged by 
differentiation: 


d 


— e 
dx 
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An equivalent statement is that y = e x is a solution of the differential 
equation 


dy_ 


More generally, if k is any given nonzero constant, then all of the functions 
y = ce kx are solutions of the differential equation 



( 2 ) 


This is easy to verify by differentiation, and can also be discovered by sepa¬ 
rating the variables and integrating: 



Further, it is not difficult to show that these functions are the only solu¬ 
tions of equation (2) [see Problem 1]. In this section we discuss a surpris¬ 
ingly wide variety of applications of these facts to a number of different 
sciences. 

Example 1. Continuously compounded interest. If P dollars is depos¬ 
ited in a bank that pays an interest rate of 6 percent per year, compounded 
semiannually, then after t years the accumulated amount is 


A = P (1 + 0.03) 2f . 


More generally, if the interest rate is 100/c percent (fc = 0.06 for 6 percent), 
and if this interest is compounded n times a year, then after t years the 
accumulated amount is 



If n is now increased indefinitely, so that the interest is compounded 
more and more frequently, then we approach the limiting case of con¬ 
tinuously compounded interest. 7 To find the formula for A under these 
circumstances, we observe that (1) yields 


Many banks pay interest daily, which corresponds to n = 365. This number is large enough to 
make continuously compounded interest a very accurate model for what actually happens. 
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so 


A = Pe kt . (3) 

We describe this situation by saying that the amount A grows exponen¬ 
tially, or provides an example of exponential growth. To understand the 
meaning of the constant k from a different point of view, we differentiate 
(3) to obtain 


— = Pke u =kA. 
dt 

If we write this differential equation for A in the form 

dA/A , 

—-— = k, 
dt 

then we see that k can be thought of as the fractional change in A per unit 
time, and 100ft: is the percentage change in A per unit time. 


Example 2. Population growth. Suppose that x 0 bacteria are placed in 
a nutrient solution at time t = 0, and that x = x(t) is the population of the 
colony at a later time t. If food and living space are unlimited, and if 
as a consequence the population at any moment is increasing at a rate 
proportional to the population at that moment, find rasa function of f. 8 

Since the rate of increase of x is proportional to x itself, we can write 
down the differential equation 


By separating the variables and integrating, we get 
dr 

— = kdt , logx = kt + c. 

x 

Since x=x 0 when t = 0, we have c =logx 0 , so log x = kt + log x 0 and 

X = XnC^ 1 . 

We therefore have another example of exponential growth. 


(4) 


Briefly, this assumption about the rate means that we expect twice as many "births" in a 
given short interval of time when twice as many bacteria are present. 
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To make these ideas more concrete, let us assume for the sake of dis¬ 
cussion that the total human population of the earth grows in this way. 
According to the United Nations demographic experts, this population 
is increasing at an overall rate of approximately 2 percent per year, so 
fc = 0.02 = l/50 and (4) becomes 


x=x 0 e t/50 . (5) 

To find the "doubling time" T, that is, the time needed for the total num¬ 
ber of people in the world to increase by a factor of 2, we replace (5) by 

2x 0 =x 0 e T/5 °. 


This yields T/50 = log 2, so 

T=50 log 2 = 34.65 years, 


since log 2 = 0.693. 9 


Example 3. Radioactive decay. If molecules of a certain kind have a ten¬ 
dency to decompose into smaller molecules at a rate unaffected by the 
presence of other substances, then it is natural to expect that the num¬ 
ber of molecules of this kind that will decompose in a unit of time will 
be proportional to the total number present. A chemical reaction of this 
type is called a first order reaction. 

Suppose, for instance, that x 0 grams of matter are present initially, and 
decompose in a first order reaction. If x is the number of grams present 
at a later time t, then the principle stated above yields the following dif¬ 
ferential equation: 


- = kx, k> 0. 

dt 


( 6 ) 


[Since dx/dt is the rate of growth of x, -dx/dt is its rate of decay, and (6) 
says that the rate of decay is proportional to x.\ If we separate the vari¬ 
ables in (6) and integrate, we obtain 


— = -kdt, logx = -fcf + c. 
x 


9 It is worth mentioning that the population of the industrialized nations is increasing at a 
rate somewhat less than 2 percent, while that of the third world nations is increasing at a rate 
greater than 2 percent. From the point of view of the development of the human race and 
its social and political institutions over the next several centuries, this is perhaps the most 
important single fact about our contemporary world. 
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The initial condition 


x=x 0 when f = 0 (7) 

gives c = log Xg, so log x=-kt + log x 0 and 

x=x 0 e~ kt . (8) 

This function is therefore the solution of the differential equation (6) 
that satisfies the initial condition (7). Its graph is given in Figure 6. 
The positive constant k is called the rate constant, for its value is clearly 
a measure of the rate at which the reaction proceeds. As we know 
from Example 1, k can be thought of as the fractional loss of x per unit 
time. 

Very few first order chemical reactions are known, and by far the 
most important of these is radioactive decay. It is convenient to express 
the rate of decay of a radioactive element in terms of its half-life, which 
is the time required for a given quantity of the element to diminish by 
a factor of one-half. If we replace x by x 0 /2 in formula (8), then we get 
the equation 


2 


= x 0 e 


-KT 


for the half-life T, so 


kT =log 2. 

If either k or T is known from observation or experiment, this equation 
enables us to find the other. 

The situation discussed here is an example of exponential decay. This 
phrase refers only to the form of the function (8) and the manner in 
which the quantity x diminishes, and not necessarily to the idea that 
something or other is disintegrating. 



FIGURE 6 
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Example 4. Mixing. A tank contains 50 gallons of brine in which 75 
pounds of salt are dissolved. Beginning at time t = 0, brine containing 3 
pounds of salt per gallon flows in at the rate of 2 gallons per minute, and 
the mixture (which is kept uniform by stirring) flows out at the same 
rate. When will there be 125 pounds of dissolved salt in the tank? How 
much dissolved salt is in the tank after a long time? 

If x=x(f) is the number of pounds of dissolved salt in the tank at time 
t > 0, then the concentration at that time is x/50 pounds per gallon. The 
rate of change of x is 

dx 

— = rate at which salt enters tank - rate at which salt leaves tank. 

dt 

Since 


rate of entering = 3-2 = 6 lb/min 


and 


rate of leaving = (x/50) ■ 2 


—lb/min, 
25 7 


we have 


dx _ x _ 150 - x 
dt ~ ~25~ 25 


Separating variables and integrating give 

——— =—dt and log(150-x) = - — t + c. 

150-x 25 & 25 

Since x = 75 when t = 0, we see that c=log 75, so 


log(150 - x) = t + log 75, 


and therefore 


150 - x = 75e~ t/25 or x = 75(2 - r' t25 ). 

This tells us that x = 125 implies e f/25 = 3 or f/25 = log 3. We conclude that 
x = 125 pounds after 


t = 25 log 3 = 27.47 minutes, 

since log 3 = 1.0986. Also, when t is large we see that x is nearly 75 • 2 = 150 
pounds, as common sense tells us without calculation. 
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The ideas discussed in Example 3 are the basis for a scientific tool of fairly 
recent development which has been of great significance for geology and 
archaeology In essence, radioactive elements occurring in nature (with known 
half-lives) can be used to assign dates to events that took place from a few thou¬ 
sand to a few billion years ago. For example, the common isotope of uranium 
decays through several stages into helium and an isotope of lead, with a half- 
life of 4.5 billion years. When rock containing uranium is in a molten state, as 
in lava flowing from the mouth of a volcano, the lead created by this decay 
process is dispersed by currents in the lava; but after the rock solidifies, the 
lead is locked in place and steadily accumulates alongside the parent uranium. 
A piece of granite can be analyzed to determine the ratio of lead to uranium, 
and this ratio permits an estimate of the time that has elapsed since the critical 
moment when the granite crystallized. Several methods of age determination 
involving the decay of thorium and the isotopes of uranium into the various 
isotopes of lead are in current use. Another method depends on the decay 
of potassium into argon, with a half-life of 1.3 billion years; and yet another, 
preferred for dating the oldest rocks, is based on the decay of rubidium into 
strontium, with a half-life of 50 billion years. These studies are complex and 
susceptible to errors of many kinds; but they can often be checked against one 
another, and are capable of yielding reliable dates for many events in geologi¬ 
cal history linked to the formation of igneous rocks. Rocks tens of millions of 
years old are quite young, ages ranging into hundreds of millions of years are 
common, and the oldest rocks yet discovered are upwards of 3 billion years 
old. This of course is a lower limit for the age of the earth's crust, and so for 
the age of the earth itself. Other investigations, using various types of astro¬ 
nomical data, age determinations for minerals in meteorites, and so on, have 
suggested a probable age for the earth of about 4.5 billion years. 10 

The radioactive elements mentioned above decay so slowly that the meth¬ 
ods of age determination based on them are not suitable for dating events 
that took place relatively recently. This gap was filled by Willard Libby's dis¬ 
covery in the late 1940s of radiocarbon, a radioactive isotope of carbon with a 
half-life of about 5600 years. By 1950 Libby and his associates had developed 
the technique of radiocarbon dating, which added a second hand to the slow- 
moving geological clocks described above and made it possible to date events 
in the later stages of the Ice Age and some of the movements and activities of 
prehistoric man. The contributions of this technique to late Pleistocene geol¬ 
ogy and archaeology have been spectacular. 

In brief outline, the facts and principles involved are these. Radiocarbon is 
produced in the upper atmosphere by the action of cosmic ray neutrons on 
nitrogen. This radiocarbon is oxidized to carbon dioxide, which in turn is 
mixed by the winds with the nonradioactive carbon dioxide already present. 
Since radiocarbon is constantly being formed and constantly decomposing 


10 For a full discussion of these matters, as well as many other methods and results of the sci¬ 
ence of geochronology, see F. E. Zeuner, Dating the Past, 4th ed., Methuen, London, 1958. 
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back into nitrogen, its proportion to ordinary carbon in the atmosphere has 
long since reached an equilibrium state. All air-breathing plants incorporate 
this proportion of radiocarbon into their tissues, as do the animals that eat 
these plants. This proportion remains constant as long as a plant or animal 
lives; but when it dies it ceases to absorb new radiocarbon, while the supply 
it has at the time of death continues the steady process of decay. Thus, if a 
piece of old wood has half the radioactivity of a living tree, it lived about 
5600 years ago, and if it has only a fourth this radioactivity, it lived about 
11,200 years ago. This principle provides a method for dating any ancient 
object of organic origin, for instance, wood, charcoal, vegetable fiber, flesh, 
skin, bone, or horn. The reliability of the method has been verified by apply¬ 
ing it to the heartwood of giant sequoia trees whose growth rings record 
3000 to 4000 years of life, and to furniture from Egyptian tombs whose age is 
also known independently. There are technical difficulties, but the method 
is now felt to be capable of reasonable accuracy as long as the periods of time 
involved are not too great (up to about 50,000 years). 

Radiocarbon dating has been applied to thousands of samples, and labo¬ 
ratories for carrying on this work number in the dozens. Among the more 
interesting age estimates are these: linen wrappings from the Dead Sea scrolls 
of the Book of Isaiah, recently found in a cave in Palestine and thought to 
be first or second century b.c., 1917 ± 200 years; charcoal from the Lascaux 
cave in southern France, site of the remarkable prehistoric paintings, 15,516 ± 
900 years; charcoal from the prehistoric monument at Stonehenge, in southern 
England, 3798 ± 275 years; charcoal from a tree burned at the time of the volca¬ 
nic explosion that formed Crater Lake in Oregon, 6453 ± 250 years. Campsites 
of ancient man throughout the Western Hemisphere have been dated by using 
pieces of charcoal, fiber sandals, fragments of burned bison bone, and the like. 
The results suggest that human beings did not arrive in the New World until 
about the period of the last Ice Age, roughly 25,000 years ago, when the level of 
the water in the oceans was substantially lower than it now is and they could 
have walked across the Bering Straits from Siberia to Alaska. 11 


Problems 


1. If k is a given nonzero constant, show that the functions y = ce kx are the 
only solutions of the differential equation 


dy 

dx 


= ky. 


11 Libby won the 1960 Nobel Prize for chemistry as a consequence of the work described above. 
His own account of the method, with its pitfalls and conclusions, can be found in his book 
Radiocarbon Dating, 2d ed.. University of Chicago Press, 1955. 
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Hint: Assume that f(x) is a solution of this equation and show that 
f(x)/e kx is a constant. 

2. Suppose that P dollars is deposited in a bank that pays interest at an 
annual rate of r percent compounded continuously. 

(a) Find the time T required for this investment to double in value as a 
function of the interest rate r. 

(b) Find the interest rate that must be obtained if the investment is to 
double in value in 10 years. 

3. A bright young executive with foresight but no initial capital makes 
constant investments of D dollars per year at an annual interest rate of 
100k percent. Assume that the investments are made continuously and 
that interest is compounded continuously. 

(a) Find the accumulated amount A at any time t. 

(b) If the interest rate is 6 percent, what must D be if 1 million dollars is 
to be available for retirement 40 years later? 

(c) If the bright young executive is bright enough to find a safe invest¬ 
ment opportunity paying 10 percent, what must D be to achieve 
the same result of 1 million dollars 40 years later? (It is worth notic¬ 
ing that if this amount of money is simply squirreled away with¬ 
out interest each year for 40 years, the grand total will be less than 
$80,000.) 

4. A newly retired person invests total life savings of P dollars at an interest 
rate of 1007c percent per year, compounded continuously. Withdrawals 
for living expenses are made continuously at a rate of W dollars per 
year. 

(a) Find the accumulated amount A at any time t. 

(b) Find the withdrawal rate W 0 at which A will remain constant. 

(c) If W is greater than the value W 0 found in part (b), then A will 
decrease and ultimately disappear. How long will this take? 

(d) Find the time in part (c) if the interest rate is 5 percent and W = 2W 0 . 

5. A certain stock market tycoon has a fortune that increases at a rate 
proportional to the square of its size at any time. If he had 10 million 
dollars a year ago, and has 20 million dollars today, how wealthy will 
he be in 6 months? In a year? 

6. A bacterial culture of population x is known to have a growth rate pro¬ 
portional to x itself. Between 6 p.m. and 7 f.m. the population triples. At 
what time will the population become 100 times what it was at 6 p.m.? 

7. The population of a certain mining town is known to increase at a 
rate proportional to itself. After 2 years the population doubled, and 
after 1 more year the population was 10,000. What was the original 
population? 
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8. It is estimated by experts on agriculture that one-third of an acre of 
land is needed to provide food for one person on a continuing basis. It 
is also estimated that there are 10 billion acres of arable land on earth, 
and that therefore a maximum population of 30 billion people can be 
sustained if no other sources of food are known. The total world popu¬ 
lation at the beginning of 1970 was 3.6 billion. Assuming that the popu¬ 
lation continues to increase at the rate of 2 percent per year, when will 
the earth be full? What will be the population in the year 2000? 

9. A mold grows at a rate proportional to the amount present. At the 
beginning the amount was 2 grams. In 2 days the amount has increased 
to 3 grams. 

(a) If x = x(f) is the amount of the mold at time t, show that x = 2(3/2) t/2 . 

(b) Find the amount at the end of 10 days. 

10. In Example 2, assume that living space for the colony of bacteria is 
limited and food is supplied at a constant rate, so that competition 
for food and space acts in such a way that ultimately the population 
will stabilize at a constant level x, (x, can be thought of as the larg¬ 
est population sustainable by this environment). Assume further that 
under these conditions the population grows at a rate proportional to 
the product of x and the difference x 1 - x, and find x as a function of t. 
Sketch the graph of this function. When is the population increasing 
most rapidly? 

11. Nuclear fission produces neutrons in an atomic pile at a rate propor¬ 
tional to the number of neutrons present at any moment. If n 0 neutrons 
are present initially, and n x and n 2 neutrons are present at times f , and 
f 2 , show that 


'th' 

*2 

" thf 





12. If half of a given quantity of radium decomposes in 1600 years, what 
percentage of the original amount will be left at the end of 2400 years? 
At the end of 8000 years? 

13. If the half-life of a radioactive substance is 20 days, how long will it take 
for 99 percent of the substance to decay? 

14. A field of wheat teeming with grasshoppers is dusted with an insecti¬ 
cide having a kill rate of 200 per 100 per hour. What percentage of the 
grasshoppers are still alive 1 hour later? 

15. Uranium-238 decays at a rate proportional to the amount present. If x 1 
and x 2 grams are present at times f, and t 2 , show that the half-life is 

(f 2 -fi)log2 

log(x!/x 2 ) 







The Nature of Differential Equations 


29 


16. Suppose that two chemical substances in solution react together to 
form a compound. If the reaction occurs by means of the collision and 
interaction of the molecules of the substances, then we expect the rate 
of formation of the compound to be proportional to the number of colli¬ 
sions per unit time, which in turn is jointly proportional to the amounts 
of the substances that are untransformed. A chemical reaction that pro¬ 
ceeds in this manner is called a second order reaction, and this law of 
reaction is often referred to as the law of mass action. Consider a second 
order reaction in which x grams of the compound contain ax grams of 
the first substance and bx grams of the second, where a + b = 1. If there 
are aA grams of the first substance present initially, and bB grams of the 
second, and if x - 0 when t = 0, find x as a function of the time f. 12 

17. Many chemicals dissolve in water at a rate which is jointly proportional 
to the amount undissolved and to the difference between the concen¬ 
tration of a saturated solution and the concentration of the actual solu¬ 
tion. For a chemical of this kind placed in a tank containing G gallons 
of water, find the amount x undissolved at time t if x = x 0 when t = 0 and 
x = when t = t v and if S is the amount dissolved in the tank when the 
solution is saturated. 

18. Suppose that a given population can be divided into two groups: those 
who have a certain infectious disease, and those who do not have it but 
can catch it by having contact with an infected person. If x and y are the 
proportions of infected and uninfected people, then x + y = l. Assume 
that (1) the disease spreads by the contacts just mentioned between sick 
people and well people, (2) that the rate of spread dx/dt is proportional 
to the number of such contacts, and (3) that the two groups mingle 
freely with each other, so that the number of contacts is jointly propor¬ 
tional to x and y. If x = x 0 when t = 0, find x as a function of f, sketch the 
graph, and use this function to show that ultimately the disease will 
spread through the entire population. 

19. A tank contains 100 gallons of brine in which 40 pounds of salt are dis¬ 
solved. It is desired to reduce the concentration of salt to 0.1 pounds per 
gallon by pouring in pure water at the rate of 5 gallons per minute and 
allowing the mixture (which is kept uniform by stirring) to flow out at 
the same rate. How long will this take? 

20. An aquarium contains 10 gallons of polluted water. A filter is attached 
to this aquarium which drains off the polluted water at the rate of 5 gal¬ 
lons per hour and replaces it at the same rate by pure water. How long 
does it take to reduce the pollution to half its initial level? 


12 Students who are especially interested in first and second order chemical reactions will 
find a much more detailed discussion by Linus Pauling, probably the greatest chemist of 
the twentieth century, in his book General Chemistry, 3d ed., W. H. Freeman and Co., San 
Francisco, 1970. See particularly the chapter "The Rate of Chemical Reactions," which is 
Chapter 16 in the 3d edition. 
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21. A party is being held in a room that contains 1800 cubic feet of air which 
is originally free of carbon monoxide. Beginning at time t = 0 several 
people start smoking cigarettes. Smoke containing 6 percent carbon 
monoxide is introduced into the room at the rate of 0.15 cubic feet/min, 
and the well-circulated mixture leaves at the same rate through a small 
open window. Extended exposure to a carbon monoxide concentration 
as low as 0.00018 can be dangerous. When should a prudent person 
leave this party? 

22. According to Lambert's law of absorption, the percentage of incident light 
absorbed by a thin layer of translucent material is proportional to the 
thickness of the layer. 13 If sunlight falling vertically on ocean water is 
reduced to one-half its initial intensity at a depth of 10 feet, at what 
depth is it reduced to one-sixteenth its initial intensity? Solve this prob¬ 
lem by merely thinking about it, and also by setting up and solving a 
suitable differential equation. 

23. If sunlight falling vertically on lake water is reduced to three-fifths its 
initial intensity I 0 at a depth of 15 feet, find its intensity at depths of 30 
feet and 60 feet. Find the intensity at a depth of 50 feet. 

24. Consider a column of air of cross-sectional area 1 square inch extend¬ 
ing from sea level up to "infinity." The atmospheric pressure p at an 
altitude h above sea level is the weight of the air in this column above 
the altitude h. Assuming that the density of the air is proportional to 
the pressure, show that p satisfies the differential equation 



and obtain the formula p = p 0 e~ ch , where p 0 is the atmospheric pressure 
at sea level. 

25. Assume that the rate at which a hot body cools is proportional to the 
difference in temperature between it and its surroundings {Newton’s 
law of cooling 14 ). A body is heated to 110°C and placed in air at 10°C. 
After 1 hour its temperature is 60°C. How much additional time is 
required for it to cool to 30°C? 

26. A body of unknown temperature is placed in a freezer which is kept 
at a constant temperature of 0°F. After 15 minutes the temperature of 


13 Johann Heinrich Lambert (1728-1777) was a Swiss-German astronomer, mathematician, 
physicist, and man of learning. He was mainly self-educated, and published works on the 
orbits of comets, the theory of light, and the construction of maps. The Lambert equal-area 
projection is well known to all cartographers. He is remembered among mathematicians for 
having given the first proof that n is irrational. 

14 Newton himself applied this rule to estimate the temperature of a red-hot iron ball. So little 
was known about the laws of heat transfer at that time that his result was only a rough 
approximation, but it was certainly better than nothing. 
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the body is 30°F and after 30 minutes it is 15°F. What was the initial 
temperature of the body? Solve this problem by merely thinking about 
it, and also by solving a suitable differential equation. 

27. A pot of carrot-and-garlic soup cooling in air at 0°C was initially boil¬ 
ing at 100°C and cooled 20° during the first 30 minutes. How much will 
it cool during the next 30 minutes? 

28. For obvious reasons, the dissecting-room of a certain coroner is kept 
very cool at a constant temperature of 5°C (- 41°F). While doing an 
autopsy early one morning on a murder victim, the coroner himself is 
killed and the victim's body is stolen. At 10 a.m. the coroner's assistant 
discovers his chief's body and finds its temperature to be 23°C, and at 
noon the body's temperature is down to 18.5°C. Assuming the coroner 
had a normal temperature of 37°C (- 98.6°F) when he was alive, when 
was he murdered? 15 

29. The radiocarbon in living wood decays at the rate of 15.30 disintegra¬ 
tions per minute (dpm) per gram of contained carbon. Using 5600 years 
as the half-life of radiocarbon, estimate the age of each of the following 
specimens discovered by archaeologists and tested for radioactivity in 
1950: 

(a) a piece of a chair leg from the tomb of King Tutankhamen, 10.14 
dpm; 

(b) a piece of a beam of a house built in Babylon during the reign of 
King Hammurabi, 9.52 dpm; 

(c) dung of a giant sloth found 6 feet 4 inches under the surface of the 
ground inside Gypsum Cave in Nevada, 4.17 dpm; 

(d) a hardwood atlatl (spear-thrower) found in Leonard Rock Shelter in 
Nevada, 6.42 dpm. 


5 Falling Bodies and Other Motion Problems 

In this section we study the dynamical problem of determining the motion 
of a particle along a given path under the action of given forces. We con¬ 
sider only two simple cases: a vertical path, in which the particle is fall¬ 
ing either freely under the influence of gravity alone, or with air resistance 
taken into account; and a circular path, typified by the motion of the bob of 
a pendulum. 


15 The idea for this problem is due to James F. Hurley, "An Application of Newton's Law of 
Cooling," The Mathematics Teacher, vol. 67 (1974), pp. 141-2. 
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Free fall. The problem of a freely falling body was discussed in Section 1, 
and we arrived at the differential equation 


d 2 y 

dt 2 


= S 


( 1 ) 


for this motion, where y is the distance down to the body from some fixed 
height. One integration yields the velocity, 

v= d J = gt +Ci- (2) 

dt 


Since the constant c x is clearly the value of v when t = 0, it is the initial velocity 
v 0 , and (2) becomes 


dy 

v.-.gt + v„. 


(3) 


On integrating again we get 

1 a2 , 

y = +v ° t+Ci - 

The constant c 2 is the value of y when t = 0, or the initial position y 0 , so we 
finally have 

1 , 

y = 2«* + +y 0 (4) 

as the general solution of (1). If the body falls from rest starting at y = 0, so that 
v 0 -y 0 = Q, then (3) and (4) reduce to 


v = gt and 



On eliminating t we have the useful equation 

v = V2gy (5) 

for the velocity attained in terms of the distance fallen. This result can also 
be obtained from the principle of conservation of energy, which can be stated 
in the form 
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kinetic energy + potential energy = a constant. 

Since our body falls from rest starting at y = 0, the fact that its gain in kinetic 
energy equals its loss in potential energy gives 


1 

2 


mv 2 


= mgy, 


and (5) follows at once. 


Retarded fall. If we assume that air exerts a resisting force proportional 
to the velocity of our falling body, then the differential equation of the 
motion is 


d 2 u dy 

where c = k/m [see Equation l-(3)]. If dy/dt is replaced by v, this becomes 

dv 


( 6 ) 


dt 


= g~cv. 


(7) 


On separating variables and integrating, we get 

dv 


= dt 


and 


1 


g-cv 


log(g-cv) = t + c u 


so 


g-cv= c 2 e~ ct . 


The initial condition v - 0 when t- 0 gives c 2 -g, so 


v = ^(l-e- ct ). 
c 


( 8 ) 
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Since c is positive, v -»■ g/c as t This limiting value of v is called the ter¬ 
minal velocity. If we wish, we can now replace v by dy/dt in (8) and perform 
another integration to find y as a function of t. 


The motion of a pendulum. Consider a pendulum consisting of a bob of 
mass m at the end of a rod of negligible mass and length a. If the bob is pulled 
to one side through an angle a and released (Figure 7), then by the principle 
of conservation of energy we have 



= mg(a cos9 - a cosa). 


Since s = a0 and v-ds/dt-a{dQ/dt), this equation gives 


1 

2 



ga (cos 0-cos a); 


(9) 


( 10 ) 


and on solving for dt and taking into account the fact that 0 decreases as t 
increases (for small f), we get 



dQ 



FIGURE 7 
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If T is the period, that is, the time required for one complete oscillation, then 

T I a |* dQ 
4 ]j 2g J Vcos0-cosa 


or 


T = 



dd 

%/cos 0-cos a ' 


( 11 ) 


The value of T in this formula clearly depends on a, which is the reason why 
pendulum clocks vary in their rate of keeping time as the bob swings through 
a larger or smaller angle. 16 Formula (11) for the period can be expressed more 
satisfactorily as follows. Since by one of the half-angle formulas of trigonom¬ 
etry we have 


cos0 = l-2sin 2 


0 

2 


and 


we can write 


cos a = l-2sin 2 


a 

2 ' 


T= 2 Pf d ° 

V 8 { -ysin 2 (a/2)-sin 2 (0/2) 


2 pf de 

~ sin 2 (0/2) 


k 


. a 
sm —. 
2 


( 12 ) 


We now change the variable from 0 to 4> by putting sin (0/2) = k sin (}), so that 
4) increases from 0 to n/2 as 0 increases from 0 to a, and 

1 0 

— cos —d0 = kcosddd) 

2 2 


or 

^ 2/c cos (j) cfcj) 2^/k 2 -sin 2 (0/2) d<\> 

cos(0/2) -y/l — fc 2 sin 2 4> 


16 This dependence of the period on the amplitude of the swing is what is meant by the "circu¬ 
lar error" of pendulum clocks. 
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This enables us to write (12) in the form 


where 


r—lt/2 

T = 4 1 f d 4> 

\g J ^l~k 2 sin 2 4> 




(13) 


is a function of k and eft called the elliptic integral of the first kind. 17 The elliptic 
integral of the second kind, 


<t> 

E(k,<\>) = J*^/l-fc 2 sin 2 ()) d(|), 

o 

arises in connection with the problem of finding the circumference of an 
ellipse (see Problem 9). These elliptic integrals cannot be evaluated in terms 
of elementary functions. Since they occur quite frequently in applications to 
physics and engineering, their values as numerical functions of k and <|> are 
often given in mathematical tables. 

Our discussion of the pendulum problem up to this point has focused on 
the first order equation (10). For some purposes it is more convenient to deal 
with the second order equation obtained by differentiating (10) with respect 
to t: 


= -gsine. (14) 

dt 

If we now recall that sin 0 is approximately equal to 0 for small values of 0, 
then (14) becomes (approximately) 


d 2 Q 

dt 2 


+ 


— 9 = 0 
a 


(15) 


17 It is customary in the case of elliptic integrals to violate ordinary usage by allowing the same 
letter to appear as the upper limit and as the dummy variable of integration. 
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It will be seen later (in Section 11) that the general solution of the important 
second order equation 


d 2 u 

dx 2 


+ k * 1 2 y = 0 


is 


y-c x sin kx + c 2 cos kx, 


so (15) yields 


9 = Ci sin 


g 

—t + c 2 cos 



(16) 


The requirement that 0 = a and dQ/dt -0 when t-0 implies that c, =0 and 
c 2 = a, so (16) reduces to 


9 = acos 



(17) 


The period of this approximate solution of (14) is In^Ja/g. It is interesting to 
note that this is precisely the value of T obtained from (13) when k = 0, which 
is approximately true when the pendulum oscillates through very small 
angles. 


Problems 

1. If the air resistance acting on a falling body of mass m exerts a retard¬ 
ing force proportional to the square of the velocity, then equation (7) 
becomes 


where c-k/m. If v-0 when f = 0, find v as a function of f. What is the 
terminal velocity in this case? 

2. A torpedo is traveling at a speed of 60 miles/hour at the moment it 
runs out of fuel. If the water resists its motion with a force proportional 
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to the speed, and if 1 mile of travel reduces its speed to 30 miles/hour, 
how far will it coast? 18 

3. A rock is thrown upward from the surface of the earth with initial 
velocity 128 feet/second. Neglecting air resistance and assuming that 
the only force acting on the rock is a constant gravitational force, find 
the maximum height it reaches. When does it reach this height, and 
when does it hit the ground? Answer these questions if the initial 
velocity is v 0 . 

4. A mass m is thrown upward from the surface of the earth with initial 
velocity v 0 . If air resistance is assumed to be proportional to velocity, 
with constant of proportionality k, and if the only other force acting 
on the mass is a constant gravitational force, show that the maximum 
height attained is 


mv o 

nr 


m 2 g 

k 2 


log 


1 + 


kv o ' 

mg y 


Use l'Hospital's rule to show that this quantity —> if jig, in accordance 
with the result of Problem 3. 

5. The force that gravity exerts on a body of mass m at the surface of the 
earth is mg. In space, however, Newton's law of gravitation asserts that 
this force varies inversely as the square of the distance to the earth's 
center. If a projectile fired upward from the surface is to keep traveling 
indefinitely, and if air resistance is neglected, show that its initial veloc¬ 
ity must be at least f 2gR , where R is the radius of the earth (about 4000 
miles). This escape velocity is approximately 7 miles/second or 25,000 
miles/hour. Hint: If x is the distance from the center of the earth to the 
projectile, and v = dx/dt is its velocity, then 

d 2 x _ dv _ dv dx _ dv 
dt 2 dt dx dt dx 


6. In Problem 5, if v e denotes the escape velocity and v 0 < v e , so that the 
projectile rises high but does not escape, show that 


(Vp/Ve) 
~(v 0 /v e ) 2 


is the height it attains before it falls back to earth. 


18 In the treatment of dynamical problems by means of vectors, the words velocity and speed 
are sharply distinguished from one another. However, in the relatively simple situations we 
consider, it is permissible (and customary) to use them more or less interchangeably, as we 
do in everyday speech. 









The Nature of Differential Equations 


39 


7. Apply the ideas in Problem 5 to find the velocity attained by a body 
falling freely from rest at an initial altitude 3 R above the surface of the 
earth down to the surface. What will be the velocity at the surface if the 
body falls from an infinite height? 

8. Inside the earth, the force of gravity is proportional to the distance 
from the center. If a hole is drilled through the earth from pole to pole, 
and a rock is dropped into the hole, with what velocity will it reach the 
center? 

9. (a) Show that the length of the part of the ellipse x 2 /a 2 + y 2 /b 2 = 1 (a >b) 

that lies in the first quadrant is 



where e is the eccentricity, 

(b) Use the change of variable x-a sin <|> to transform the integral in (a) 
into 


jt/2 

«J" sjl - e 2 sin 2 <)) d 4> = aE(e, n / 2), 

o 

so that the complete circumference of the ellipse is 4 aE(e, n/2). 

10. Show that the length of one arch of y = sin x is 2^2£(^12, n/2). 

11. Show that the total length of the lemniscate r 2 = a 2 cos 20 is AaF(J2 , ic/4). 

12. Given the cylinder and sphere whose equations in cylindrical coordi¬ 
nates are r-a sin 0 and r 2 + z 2 =b 2 , with a <b, show that: 

(a) The area of the part of the cylinder that lies inside the sphere is 
4abE(a/b,n/ 2). 

(b) The area of the part of the sphere that lies inside the cylinder is 
2b 2 [n - 2E(a/b,n/2)\. 

13. Establish the following evaluations of definite integrals in terms of 
elliptic integrals: 

(a) , = J2F ( Jt/2.,n/2) [hint: put x = n/2-y, then cos y = cos 2 ([)]; 

Jo Vsinx v ' ' 

(b) j* yjcosx dx = 2j2.E^Jl/2,n/2^-j2F^Jl/2,n/2j [hint: put cos x = 
cos 2 ([>]; 

( C ) j; /2 v^ 4 sin 2 xdx = J5E ^4/5, n/2 j [hint: put x = n/2 - 4>]. 
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6 The Brachistochrone. Fermat and the Bernoullis 

Imagine that a point A is joined by a straight wire to a lower point B in 
the same vertical plane (Figure 8), and that a bead is allowed to slide with¬ 
out friction down the wire from A to B. We can also consider the case in 
which the wire is bent into an arc of a circle, so that the motion of the bead 
is the same as that of the descending bob of a pendulum. Which descent 
takes the least time, that along the straight path, or that along the circular 
path? Since the straight wire joining A and B is clearly the shortest path, 
we might guess that this wire also yields the shortest time. However, a 
moment's consideration of the possibilities will make us more skeptical 
about this conjecture. There might be an advantage in having the bead 
slide down more steeply at first, thereby increasing its speed more quickly 
at the beginning of the motion; for with a faster start, it is reasonable to 
suppose that the bead might reach B in a shorter time, even though it trav¬ 
els over a longer path. For these reasons, Galileo believed that the bead 
would descend more quickly along the circular path, and probably most 
people would agree with him. 

Many years later, in 1696, John Bernoulli posed a more general problem. 
He imagined that the wire is bent into the shape of an arbitrary curve, and 
asked which curve among the infinitely many possibilities will give the 
shortest possible time of descent. This curve is called the brachistochrone 
(from the Greek brachistos, shortest + chronos, time). Our purpose in this 



FIGURE 8 
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(a) 




FIGURE 9 


section is to understand Bernoulli's marvelous solution of this beautiful 
problem. 

We begin by considering an apparently unrelated problem in optics. 
Figure 9a illustrates a situation in which a ray of light travels from A to P 
with velocity v t and then, entering a denser medium, travels from P to B with 
a smaller velocity v 2 . In terms of the notation in the figure, the total time T 
required for the journey is given by 

T _ iv | \]b 2 + (c-x) 2 

Vi v 2 


If we assume that this ray of light is able to select its path from A to B by way 
of P in such a way as to minimize T, then dT/dx = 0 and by the methods of 
elementary calculus we find that 

x _ c-x 
Vi^ja 2 + x 2 v 2 ^b 2 +(c-x) 2 
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or 


sinai _ sina 2 
Vi v 2 


This is Snell's lazv of refraction, which was originally discovered experimen¬ 
tally in the less illuminating form sin cq/sin a 2 = a constant. 19 The assump¬ 
tion that light travels from one point to another along the path requiring the 
shortest time is called Fermat's principle of least time. This principle not only 
provides a rational basis for Snell's law, but can also be applied to find the 
path of a ray of light through a medium of variable density, where in general 
light will travel along curves instead of straight lines. In Figure 9b we have 
a stratified optical medium. In the individual layers the velocity of light is 
constant, but the velocity decreases from each layer to the one below it. As 
the descending ray of light passes from layer to layer, it is refracted more and 
more toward the vertical, and when Snell's law is applied to the boundaries 
between the layers, we obtain 

sinai _ sina 2 _ sina 3 _ sina 4 

Vi V 2 V 3 Vi 


If we next allow these layers to grow thinner and more numerous, then in 
the limit the velocity of light decreases continuously as the ray descends, and 
we conclude that 


-= a constant. 

v 

This situation is indicated in Figure 9c, and is approximately what happens 
to a ray of sunlight falling on the earth as it slows in descending through 
atmosphere of increasing density. 

Returning now to Bernoulli's problem, we introduce a coordinate system 
as in Figure 10 and imagine that the bead (like the ray of light) is capable of 
selecting the path down which it will slide from A to B in the shortest pos¬ 
sible time. The argument given above yields 

sin a , , ... 

-= a constant. (1) 

v 


19 Willebrord Snell (1591-1626) was a Dutch astronomer and mathematician. At the age of 
twenty-two he succeeded his father as professor of mathematics at Leiden. His fame rests 
mainly on his discovery in 1621 of the law of refraction, which played a significant role in the 
development of both calculus and the wave theory of light. 
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FIGURE 10 


By the principle of conservation of energy, the velocity attained by the bead 
at a given level is determined solely by its loss of potential energy in reach¬ 
ing that level, and not at all by the path that brought it there. As in the pre¬ 
ceding section, this gives 


v = gy- 

From the geometry of the situation we also have 


sin a = cos P = 


sec 


P V l + tan 2 p y /l + (ijf 


( 2 ) 


(3) 


On combining equations (1), (2), and (3)—obtained from optics, mechanics, 
and calculus—we get 


y[i + (y') 2 ]=c (4) 

as the differential equation of the brachistochrone. 

We now complete our discussion, and discover what curve the brachisto¬ 
chrone actually is, by solving (4). When y' is replaced by dij/dx and the vari¬ 
ables are separated, (4) becomes 


dx = 


h/2 


dy. 


v-y) 


(5) 
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At this point we introduce a new variable (|) by putting 

= tan^. 


f V/2 

y 

\ c ~y j 


so that y-c sin 2 4>, dy = 2c sin <|i cos c|) dcf), and 

dx = tan (j) dy 
- 2c sin 2 <{> d<\> 

= c(l - cos 24 >) c?4>. 

Integration now yields 


x = — (2(j) - sin 24») + C\. 


( 6 ) 


Our curve is to pass through the origin, so by (6) we have x = y = 0 when 4> = 0, 
and consequently c l = 0. Thus 


x = ^ (2(j) - sin 24») (7) 

and 

y = csin 2 (|) = ^(l-cos2(|)). (8) 

If we now put a = c/2 and 0 = 2<|>, then (7) and (8) become 

x=a(0-sin9) and y = fl(l-cos9). (9) 

These are the standard parameteric equations of the cycloid shown in Figure 
11, which is generated by a point on the circumference of a circle of radius a 
rolling along the x-axis. We note that there is a single value of a that makes 
the first arch of this cycloid pass through the point B in Figure 10; for if a is 
allowed to increase from 0 to °°, then the arch inflates, sweeps over the first 
quadrant of the plane, and clearly passes through B for a single suitably cho¬ 
sen value of a. 

Some of the geometric properties of the cycloid are perhaps familiar to 
the reader from elementary calculus. For example, the length of one arch is 
4 times the diameter of the generating circle, and the area under one arch is 
3 times the area of this circle. This remarkable curve has many other interest¬ 
ing properties, both geometric and physical, and some of these are described 
in the problems below. 
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FIGURE 11 


We hope that the necessary details have not obscured the wonderful imag¬ 
inative qualities in Bernoulli's brachistochrone problem and his solution of it, 
for this whole structure of thought is a work of intellectual art of a very high 
order. In addition to its intrinsic interest, the brachistochrone problem has a 
larger significance: it was the historical source of the calculus of variations —a 
powerful branch of analysis that in modern times has penetrated deeply into 
the hidden simplicities at the heart of the physical world. We shall discuss 
this subject in Chapter 12, and develop a general method for obtaining equa¬ 
tion (4) that is applicable to a wide variety of similar problems. 


Note on Fermat. Pierre de Fermat (1601-1665) was perhaps the greatest math¬ 
ematician of the seventeenth century, but his influence was limited by his 
lack of interest in publishing his discoveries, which are known mainly from 
letters to friends and marginal notes in the books he read. By profession he 
was a jurist and the king's parliamentary counselor in the French provin¬ 
cial town of Toulouse. However, his hobby and private passion was math¬ 
ematics. In 1629 he invented analytic geometry, but most of the credit went to 
Descartes, who hurried into print with his own similar ideas in 1637. At this 
time—13 years before Newton was born—Fermat also discovered a method 
for drawing tangents to curves and finding maxima and minima, which 
amounted to the elements of differential calculus. Newton acknowledged, 
in a letter that became known only in 1934, that some of his own early ideas 
on this subject came directly from Fermat. In a series of letters written in 
1654, Fermat and Pascal jointly developed the fundamental concepts of the 
theory of probability. His discovery in 1657 of the principle of least time, and 
its connection with the refraction of light, was the first step ever taken in the 
direction of a coherent theory of optics. It was in the theory of numbers, how¬ 
ever, that Fermat's genius shone most brilliantly, for it is doubtful whether his 
insight into the properties of the familiar but mysterious positive integers has 
ever been equaled. We mention a few of his many discoveries in this field. 
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1. Fermat's two squares theorem: Every prime number of the form 4 n +1 
can be written as the sum of two squares in one and only one way 

2. Fermat's theorem: If p is any prime number and n is any positive inte¬ 
ger, then p divides nP-n. 

3. Fermat's last theorem: If n > 2, then x n + y n = z n cannot be satisfied by 
any positive integers x, y, z. 

He wrote this last statement in the margin of one of his books, in connec¬ 
tion with a passage dealing with the fact that x 2 + y 2 =z 2 has many integer 
solutions. He then added the tantalizing remark, "I have found a truly won¬ 
derful proof which this margin is too narrow to contain." Unfortunately no 
proof has ever been discovered by anyone else, and Fermat's last theorem 
remains to this day one of the most baffling unsolved problems of math¬ 
ematics. Finding a proof would confer instant immortality on the finder, but 
the ambitious student should be warned that many able mathematicians 
(and some great ones) have tried in vain for hundreds of years. 

(This is the way things were, until Andrew Wiles of Princeton University 
proved Fermat's Last Theorem in 1994-95; see Annals of Mathematics 
141(3):443-551. This proof required 108 pages, and it's been said that no more 
than about a dozen people in the world are able to understand it. The way is 
wide open for someone to rediscover the one- or one-and-a-half-page proof 
that Fermat discovered but didn't bother to write down.) 

Note on the Bernoulli Family. Most people are aware that Johann Sebastian 
Bach was one of the greatest composers of all time. However, it is less well 
known that his prolific family was so consistently talented in this direction 
that several dozen Bachs were eminent musicians from the sixteenth to the 
nineteenth centuries. In fact, there were parts of Germany where the very word 
bach meant a musician. What the Bach clan was to music, the Bernoullis were 
to mathematics and science. In three generations this remarkable Swiss fam¬ 
ily produced eight mathematicians—three of them outstanding—who in turn 
had a swarm of descendants who distinguished themselves in many fields. 

James Bernoulli (1654-1705) studied theology at the insistence of his father, 
but abandoned it as soon as possible in favor of his love for science. He taught 
himself the new calculus of Newton and Leibniz, and was professor of math¬ 
ematics at Basel from 1687 until his death. He wrote on infinite series, stud¬ 
ied many special curves, invented polar coordinates, and introduced the 
Bernoulli numbers that appear in the power series expansion of the function 
tan x. In his book Ars Conjectandi he formulated the basic principle in the 
theory of probability known as Bernoulli's theorem or the law of large numbers: 
if the probability of a certain event is p, and if n independent trials are made 
with k successes, then k/n -* p as n -*■ °°. At first sight this statement may 
seem to be a trivality, but beneath its surface lies a tangled thicket of philo¬ 
sophical (and mathematical) problems that have been a source of controversy 
from Bernoulli's time to the present day. 
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James's younger brother John Bernoulli (1667-1748) also made a false start 
in his career, by studying medicine and taking a doctor's degree at Basel in 
1694 with a thesis on muscle contraction. However, he also became fasci¬ 
nated by calculus, quickly mastered it, and applied it to many problems in 
geometry, differential equations, and mechanics. In 1695 he was appointed 
professor of mathematics and physics at Groningen in Holland, and on 
James's death he succeeded his brother in the professorship at Basel. The 
Bernoulli brothers sometimes worked on the same problems, which was 
unfortunate in view of their jealous and touchy dispositions. On occasion 
the friction between them flared up into a bitter and abusive public feud, as 
it did over the brachistochrone problem. In 1696 John proposed the problem 
as a challenge to the mathematicians of Europe. It aroused great interest, 
and was solved by Newton and Leibniz as well as by the two Bernoullis. 
John's solution (which we have seen) was the more elegant, while James's— 
though rather clumsy and laborious—was more general. This situation 
started an acrimonious quarrel that dragged on for several years and was 
often conducted in rough language more suited to a street brawl than a 
scientific discussion. John appears to have been the more cantankerous of 
the two; for much later, in a fit of jealous rage, he threw his own son out of 
the house for winning a prize from the French Academy that he coveted 
for himself. 

This son, Daniel Bernoulli (1700-1782), studied medicine like his father 
and took a degree with a thesis on the action of the lungs; and like his father 
he soon gave way to his inborn talent and became a professor of mathematics 
at St. Petersburg. In 1733 he returned to Basel and was successively professor 
of botany, anatomy, and physics. He won 10 prizes from the French Academy, 
including the one that infuriated his father, and over the years published 
many works on physics, probability, calculus, and differential equations. In 
his famous book Hydrodynamica he discussed fluid mechanics and gave the 
earliest treatment of the kinetic theory of gases. He is considered by many to 
have been the first genuine mathematical physicist. 


Problems 

1. It is stated in the text that the length of one arch of the cycloid (9) is 4 
times the diameter of the generating circle (Wren's theorem 20 ). Prove 
this. 


20 Christopher Wren (1632-1723), the greatest of English architects, was an astronomer and 
mathematician—in fact, Savilian Professor of Astronomy at Oxford—before the Great Fire 
of London in 1666 gave him his opportunity to build St. Paul's Cathedral, as well as dozens 
of smaller churches throughout the city. 
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2. It is stated in the text that the area under one arch of the cycloid (9) is 
3 times the area of the generating circle (Torricelli's theorem 21 ). Prove 
this. 

3. Obtain equations (9) for the cycloid by direct integration from the inte¬ 
grated form of equation (5), 



by starting with the algebraic substitution u 2 =y/(c - y) and continuing 
with a natural trigonometric substitution. 

4. Consider a wire bent into the shape of the cycloid (9), and invert it as in 
Figure 10. If a bead is release d at the origin and slides down the wire 
without friction, show that n^ja/g is the time it takes to reach the point 
( na,2a ) at the bottom. 

5. Show that the number ji ja/g in Problem 4 is also the time the bead 
takes to slide to the bottom from any intermediate point, so that the 
bead will reach the bottom in the same time no matter where it is 
released. This is known as the tautochrone property of the cycloid, from 
the Greek tauto, the same + chronos, time. 22 

6. At sunset a man is standing at the base of a dome-shaped hill where 
it faces the setting sun. He throws a rock straight up in such a manner 
that the highest point it reaches is level with the top of the hill. As the 
rock rises, its shadow moves up the surface of the hill at a constant 
speed. Show that the profile of the hill is a cycloid. 


21 Evangelista Torricelli (1608-1647) was an Italian physicist and mathematician and a disciple 
of Galileo, whom he served as secretary. In addition to discovering and proving the theorem 
stated above, he advanced the first correct ideas—which were narrowly missed by Galileo— 
about atmospheric pressure and the nature of vacuums, and invented the barometer as an 
application of his theories. See James B. Conant, Science and Common Sense, Yale University 
Press, New Haven, 1951, pp. 63-71. The geometric theorems of Wren and Torricelli stated in 
Problems 1 and 2 are straightforward calculus exercises for us. It is interesting to consider 
how they might have been discovered and proved at a time when the powerful methods of 
calculus did not exist. 

22 The tautochrone property of the cyloid was discovered by the great Dutch scientist 
Christiaan Huygens (1629-1695). He published it in 1673 in his treatise on the theory of pen¬ 
dulum clocks, and it was well-known to all European mathematicians at the end of the sev¬ 
enteenth century. When John Bernoulli published his discovery of the brachistochrone in 
1696, he expressed himself in the following exuberant language (in Latin, of course): "With 
justice we admire Huygens because he first discovered that a heavy particle falls down 
along a common cycloid in the same time no matter from what point on the cycloid it begins 
its motion. But you will be petrified with astonishment when I say that precisely this cycloid, 
the tautochrone of Huygens, is our required brachistochrone." 
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Miscellaneous Problems for Chapter 1 


1 . 


2 . 


It began to snow on a certain morning, and the snow continued to 
fall steadily throughout the day. At noon a snowplow started to clear 
a road at a constant rate in terms of the volume of snow removed per 
hour. The snowplow cleared 2 miles by 2 p.m. and 1 more mile by 
4 p.m. When did it start snowing? 


A mothball whose radius was originally — inch is found to have 

a radius of — inch after 1 month. Assuming that it evaporates at a 
8 

rate proportional to its surface, find the radius as a function of time. 
After how many more months will it disappear altogether? 


3. A tank contains 100 gallons of pure water. Beginning at time t = 0, brine 
containing 1 pound salt/gallon flows in at the rate of 1 gallon/minute, 
and the mixture (which is kept uniform by stirring) flows out at the 
same rate. When will there be 50 pounds of dissolved salt in the tank? 


4. A large tank contains 100 gallons of brine in which 200 pounds of 
salt are dissolved. Beginning at time t = 0, pure water flows in at the 
rate of 3 gallons/minute, and the mixture (which is kept uniform by 
stirring) flows out at the rate of 2 gallons/minute. How long will it 
take to reduce the amount of salt in the tank to 100 pounds? 

5. A smooth football having the shape of an ellipsoid 12 inches long 
and 6 inches thick is lying outdoors in a rainstorm. Find the paths 
along which water will run down its sides. 


6. If c is a positive constant and a is a positive parameter, then 


+ 



= 1 


is the equation of the family of all ellipses (a > c) and hyperbolas 
(a < c) with foci at the points (±c, 0). Show that this family of confocal 
conics is self-orthogonal (see Problem 3-2). 

7. According to Torricelli's law, water in an open tank will flow out 
through a small hole in the bottom with the speed it would acquire 
in falling freely from the water level to the hole. A hemispherical 
bowl of radius R is initially full of water, and a small circular hole 
of radius r is punched in the bottom at time t = 0. How long will the 
bowl take to empty itself? 

8. The clepsydra, or ancient water clock, was a bowl from which water 
was allowed to escape through a small hole in the bottom. It was 
often used in Greek and Roman courts to time the speeches of law¬ 
yers, in order to keep them from talking too much. Find the shape it 
should have if the water level is to fall at a constant rate. 
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9. Two open tanks with identical small holes in the bottom drain in the 
same time. One is a cylinder with a vertical axis and the other is a 
cone with vertex down. If they have equal bases and the height of the 
cylinder is h, what is the height of the cone? 

10. A cylindrical can partly filled with water is rotated about its axis 
with constant angular velocity co. Show that the surface of the water 
assumes the shape of a paraboloid of revolution. (Hint: The centrip¬ 
etal force acting on a particle of water of mass m at the free surface is 
mxM 2 where x is its distance from the axis, and this is the resultant of 
the downward gravitational force mg and the normal reaction force 
R due to other nearby particles of water.) 

11. Consider a bead at the highest point of a circle in a vertical plane, 
and let that point be joined to any lower point on the circle by a 
straight wire. If the bead slides down the wire without friction, show 
that it will reach the circle in the same time regardless of the position 
of the lower point. 

12. A chain 4 feet long starts with 1 foot hanging over the edge of a table. 
Neglect friction, and find the time required for the chain to slide off 
the table. 

13. Experience tells us that a man holding one end of a rope wound 
around a wooden post can restrain with a small force a much greater 
force at the other end. Quantitatively, is is not difficult to see that 
if T and T + AT are the tensions in the rope at angles 0 and 0 + A0 
in Figure 12, then a normal force of approximately T A0 is exerted 
by the rope on the post in the region between 0 and 0 + A0. It fol¬ 
lows from this that if g is the coefficient of friction between the rope 
and the post, then AT is approximately gT A0. Use this statement to 



FIGURE 12 
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formulate the differential equation relating T and 0, and solve this 
equation to find T as a function of 0, g, and the force T 0 exerted by the 
man. 

14. A load L is supported by a tapered circular column whose material 
has density a. If the radius of the top of the column is r 0 , find the 
radius rata distance x below the top if the areas of the horizontal 
cross sections are proportional to the total loads they bear. 

15. The President and the Prime Minister order coffee and receive cups 
of equal temperature at the same time. The President adds a small 
amount of cool cream immediately, but does not drink his coffee 
until 10 minutes later. The Prime Minister waits 10 minutes, and 
then adds the same amount of cool cream and begins to drink. Who 
drinks the hotter coffee? 

16. A destroyer is hunting a submarine in a dense fog. The fog lifts for 
a moment, discloses the submarine on the surface 3 miles away, and 
immediately descends. The speed of the destroyer is twice that of 
the submarine, and it is known that the latter will at once dive and 
depart at full speed in a straight course of unknown direction. What 
path should the destroyer follow to be certain of passing directly 
over the submarine? Hint: Establish a polar coordinate system with 
the origin at the point where the submarine was sighted. 

17. Four bugs sit at the corners of a square table of side a. At the same 
instant they all begin to walk with the same speed, each moving 
steadily toward the bug on its right. If a polar coordinate system is 
established on the table, with the origin at the center and the polar 
axis along a diagonal, find the path of the bug that starts on the 
polar axis and the total distance it walks before all bugs meet at the 
center. 


Appendix A: Some Ideas From the Theory of 
Probability: The Normal Distribution Curve (or 
Bell Curve) and Its Differential Equation 

Suppose a measurement or experiment is performed many times, and that 
its result is a number. We can think, for example, of weighing the babies born 
in a certain hospital during a given year, or of measuring the annual rainfall 
in a certain city over a number of years. Suppose the possible results of our 
measurement or experiment are numbers x that lie in an interval a < x < b. To 
record our results we can divide the interval [a, b] into n subintervals of equal 
length, say a-x 0 < x x < x 2 < < x n = b, and then count the number of times 
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FIGURE 13 


m k that our result is a number between x k _ t and x k . When this way of arrang¬ 
ing the data is represented by a step function whose height is m k over the /cth 
subinterval, the resulting graph is called a histogram. 

In Figure 13 the birth weight data in the table on the left—taken from gen¬ 
uine vital statistics—is displayed in the histogram on the right. The total 
number of babies born in this hospital in this year was 2555. To find the aver¬ 
age birth weight directly, we would have to calculate the number 


sum of all birth weights 
total number of babies 


But our table doesn't provide individual birth weights, so without access to 
the original data this calculation is beyond our power. However, by using 
the midpoint of each weight interval, we find that the sum of all the birth 
weights is approximately 

(1.5) (12)+ (2.5) (18)+ (3.5) (46)+ (4.5) (158) 

+ (5.5) (422) + (6.5) (828) + (7.5)(49l) + (8.5)(429) 

+ (9.5)(133) + (10.5)(18) = 17,419.5 lb. (1) 


The average birth weight, also called the mean, is therefore approximately 
17,419.5/2555 = 6.82 lb. In the histogram each term of the sum (1) is the product 
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of x-coordinate of the midpoint of a subinterval and the area of the corre¬ 
sponding rectangle. 

If we reconstruct our histogram by using a larger and larger number of 
smaller and smaller subintervals, we expect the graph to approach the graph 
of a smooth function f(x). We can now adjust the unit of length along the 
vertical axis so that the total area under the curve is 1. This gives a func¬ 
tion y=f(x) called the frequency density. This function has two characteristic 
properties: 


b 

f(x) > 0 and j* f(x) dx = 1. (2) 


Also, if a < c < d < b, then the integral 

d 

J* f(x) dx (3) 

C 

gives the ratio of number of times the measurement produces a value between 
c and d to the total number of measurements, that is, the relative frequency of 
the result c < x < d. In the same way, fix) dx can be thought of as the proportion 
of results that lie between x and x + dx. From this point of view, the integral 
(3) can be interpreted as the probability that a randomly chosen measure¬ 
ment will have a result between c and d, and /(x) is then called a probability 
density function. 

In order to gain further insight into these concepts, let us for a moment 
think of/(x) as the mass density function of a rod of total mass 1 that lies 
along the x-axis between x=a and x = b. Then/(x) dx is the element of mass, 
x/(x) dx is the moment of this element of mass about the origin, and the 
integral 


x = 



dx 


(4) 


is the center of mass of the rod, since 



dx = 1. Also, the integral 


b 

I = j*(x-x) 2 /(x) dx 


(5) 
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is the moment of inertia of the rod about the line x - x as axis. We know from 
our experience in studying calculus that this quantity is small if most ele¬ 
ments of mass are nestled close to the axis and larger otherwise. 

In the case of a general probability density/(x) with properties (2), the inte¬ 
gral corresponding to (4), 


m = 



dx, 


is called the mean. As we know, the mean m is the point on the x-axis where 
the region under the probability density graph, if it were made out of card¬ 
board and placed in a horizontal position, would balance on the line x-m. 
The square root of the integral corresponding to (5), 



is called the standard deviation. If o is small, the results of our measurements 
cluster around the mean m ; and if o is large, then a significant portion of 
these results are farther away from m. 

In the general mathematical theory of probability of which these ideas are 
only a hint, it is customary to consider probability densities that are defined 
for all x, so that no limitations are placed on the possible results of the mea¬ 
surement or experiment under consideration. A probability density is then 
defined to be any function that satisfies the conditions 

oo 

f(x) > 0 and J* f(x) dx = 1, (6) 

—oo 

and the mean m and standard deviation a are defined by 

00 00 

777 = I xf(x)dx and o 2 = I (x-m) 2 f(x)dx. (7) 


Of course, these integrals are improper integrals in the sense discussed in 
calculus courses. 


Several Important Improper Integrals. To reach our goal of understanding 
the normal distribution we must first consider several properties of the func¬ 
tion y = f(x) = e ', whose bell-shaped graph is sketched in Figure 14. We 
begin by pointing out that this function is even, which means that f(-x) =/(x), 
so the graph is symmetric about the y-axis. Also, the values of the function 
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FIGURE 14 


are all positive, it has a maximum y = 1 at x = 0, and the graph has two points 
of inflection at x = ± \ -Jl (check this by calculating y"). It is clear that 

lim e~ x * = 0, (8) 

X-»±cO 


because e x =l!e x and e x -4<»asi- 


± oo. Also 


lim xe x =0, (9) 

X—»±oo 


because for 


x\ >1 we have 



x e 


2 - 
< x e 


and we know that 


lim M±0= x 2 e 1 = lim z _ >o0 ze 2 = 0. 

2 

It is a remarkable fact that the area under the curve y = e x has the finite 
value 


oo 



—oo 


because 


oo 



0 


This astonishing formula connecting e and tt is best established by using 
double integration in polar coordinates. To understand this, write 


1 = 


oo 



0 
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Since it doesn't matter what letter we use for the variable of integration, we 
have 


1 2 




By moving the first factor past the second integral sign, this can be written 
in the form 


I 2 


1 


o vo y 



00 00 

= JJe-^ +!/2> dxdy. 
0 0 


This double integral is extended over the entire first quadrant of the xy- 
plane. In polar coordinates it becomes 


I 2 





o 


n 

4' 


so I = 



which is (11), 


Next, since the integrand is an odd function, which means that f(-x)--f(x) 
it is clear that 


1 


xe X ~dx = 0. 


( 12 ) 


Finally, an integration by parts with u=x, dv = xe 1 dx gives 


f 2 -x 2 j 1 —A If -X 2 J 

\xe dx = —xe +— \e dx. 

J 2 2 J 


so 


jx 2 e X ~dx = --^te r + ^dx. 

0 0 
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By (9) and (11) we now have 


00 t 

I x 2 e x dx = lim x 2 e~ x dx 

J *-><*> J 

0 0 


1,-^.,. 1 


= lim — te + lim — | e ' dx 

f->°° l 2 ) f->°° 2, 




= 0 + 


oo 

1 f e 1 dx = — yfn. 
: J 4 


Since the integrand x ’e x is an even function, we conclude that 


oo 



—oo 


The Normal Curve. Let m be any number and a any positive number. Then 
the function 


/(*) = 


1 p -(x-mflla 2 

Oa/2 n 


(14) 


is called the normal probability density function with mean m and standard 
deviation o. Since clearly/(x) > 0 for all x, to verify what is implicitly stated 
here we must show that 


J* f(x)dx = 1. 


(15) 



(16) 


and 


J* (x - m) 2 f(x)dx 


= a 


—oo 


(17) 
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To prove these facts we use the change of variable t = (x- wz)/cj V2, so that t 
varies from to “ as i varies from to °° and 


x = m + ajlt, dx = a^Jldt, 



By using (10), (12), and (13) we establish (15), (16), and (17) as follows: 


j* f(x)dx = 



oo 



—oo 


and 


oo oo 



—oo —oo 


00 

J* (x-m) 2 f(x)dx = 

—oo 



2 G 2 f e 1 a^fldt 


—oo 



—oo 


The graph of (14) is called the normal curve with mean m and standard devia¬ 
tion a. It is symmetric about the line x = in, because the function (14) has the 
same values for x 1 - m + a and x 2 =m-a. Also, the curve is bell shaped, and the 
function assumes its maximum value of 1/a Vzii = 0.399/a at x=m. Further, 
the curve has two points of inflection at the points x-m + a and x-m-a. To 
see this we calculate 


and 


/'(*) = 


% m -(x-m) 2 /la 2 


_ 1 -(x-m) 2 / 2 a 2 (X rri) -p-m) 2 / 2 ct 2 

a 3 ^ G 5 ^ 


a 3 V2n 


x-m 


-1 


-(x-m) 2 jla 2 


/'(*) = 
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This formula tells us that the second derivative is positive for | x - m \ > a 
and negative for \x - m\ > a, which proves the statement about points of 
inflection. 

Normal curves with o = 1 and m = 0,2, -2 are shown on the left in Figure 15, 
and with m = 0 and a = 1/2,1,2 on the right. We observe that these curves are 
wide and flat for large o, and narrow and peaked for small a. For the special 
case in which m = 0 and o = l, we obtain the important standard normal prob¬ 
ability density 



(18) 


The graph of this function is shown in Figure 16. We notice that for -1 < x 
< 1 (within one standard deviation of the mean) we obtain 68.2 percent of 
the area under the curve, and for -2 < x < 2 (within two standard devia¬ 
tions of the mean) we obtain 95.4 percent of the area under the curve. It is 


/(*) 



m = — 2 m- 0 m = 2 



-2 0 2 


x 


-2 0 2 


x 


o fixed (a = 1) 


m fixed (m = 0) 


FIGURE 15 


Changes in f(x) as m varies and as o varies. 



0.4 



x 


-3 -2 -1 0 


1 2 


3 


FIGURE 16 

The standard normal curve (m = 0, o = 1). 
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an interesting fact that these percentages hold for the areas under all normal 
curves within one or two standard deviations of the mean. 

When/(x) is any probability density, the function of t defined by 



is called its distribution function. According to our previous interpretation, 
F(t) is the probability that x lies in the interval (-«>, f]. In particular, the normal 
distribution function (or simply the normal distribution) with mean m and stan¬ 
dard deviation a is the function 


F(f) = —j= f e- (x ~ mfn ^dx. 

cjhi J 


(19) 


In the simplest special case, in which m = 0 and cr = 1, it is customary to denote 
this by 


3>(f) = 


t 



—oo 


( 20 ) 


and to refer to it as the standard normal distribution. Tables have been con¬ 
structed for the function <E>(f) by the methods of numerical integration, and 
these tables can be used to solve many problems in science and mathemat¬ 
ics involving probability and statistics. Students who wish to explore these 
important ideas are urged to take an advanced course on mathematical 
probability. 

We have hinted at a procedure here, and it might be helpful to give a brief 
explanation of how this procedure works. To say that the quantity x is nor¬ 
mally distributed means that its density function is well approximated by (14) 
for suitable choices of m and a. The probability that x lies in the interval a < x 
<b is denoted by P(a < x < b) and is given by 

b 

P(a <x<b)~ ]_ f e- (x ~ mf/2,,Z dx. (21) 

af2n J 


If we make the substitution t = (x - m)/a, then a and b become 

b-m 


, a-m 
a =- and b = ■ 
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and the integral just written is transformed into 


P(a <x< b)= P(a' <t<h') = 


v 





This quantity can now be calculated by using tables to look up the numerical 
values of <E>(b') and <&(«')■ 

Many phenomena in science and society are normally distributed, and can 
therefore be modeled and calculated by using this machinery—for instance, 
the heights of men of the same age in a large population, the speeds of mol¬ 
ecules in a gas, the results of measuring a physical quantity many times, and 
so on. 


Example 1. The mean annual rainfall in New York City is 42 in. The 
annual rainfall over many years is closely approximated by the normal 
density function with m = 42 and standard deviation o = 2, 


m 


-(i-42) 2 /8 

2-fln 


A sketch of this normal curve is shown in Figure 17. Use this information 
to compute the proportion of years with rainfall between (a) 40 and 44 
in; (b) 38 and 46 in. 

Solution (a) The proportion of years with rainfall between 40 and 44 in is 


44 




FIGURE 17 
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With the change of variable t=(x - 42)/2—and access to table of values 
of €>(f)—this becomes 


1 


sfhi 


1 

jV f2/2 df = ®(l)-®(-l) 

-l 


= 0.8413-0.1587 s 0.6826. 


(b) Similarly, the proportion of years with rainfall between 38 and 46 in 
is (with the same change of variable) 



= ®(2)-<D(-2) 


= 0.9772 - 0.0228 = 0.9544. 


Example 2. An examination in school is sometimes considered to have 
done its job of spreading student grades fairly if the frequency histo¬ 
gram of grades can be approximated by a normal density function. Some 
teachers who go to this trouble then use this histogram and approximat¬ 
ing curve to estimate m and g, and assign the letter grade A to grades 
greater than m + g, B to grades between m and m + o, C to grades between 
m - g and m, D to grades between m - 2g and m - g, and F to grades 
below m - 2g. This is what is meant (or used to be meant) by grading on 
the curve. This approach to calculating grades is probably almost extinct 
in the modern era of grade inflation. 


The Differential Equation. How does it happen that these probability dis¬ 
cussions are saturated with various forms of the function e~ x ? We attempt 
to answer this question by showing how the normal probability density 
function (14) can be derived from simple and reasonable assumptions 
leading to a differential equation. 

Consider the experiment of a marksman repeatedly shooting at a target 
whose bull's eye is the origin of the xy-plane (Figure 18), and suppose that 
we are only interested in the x-coord i nates of the points of impact. These 
x-coordinates provide an ideally simple example of quantities distributed in 
the pattern we wish to examine, being bunched together around x = 0 and 
tapering off symmetrically to the sides. 

If f(x) is the probability density function of these x-coordinates, then /(x) 
dx is the probability for any particular shot that its x-coordinate lies in the 
interval from x to x + dx. Similarly the probability of the y-coordinate lying in 
the interval from y to y + dy is g(y) dxj, where g(y) is the probability density in 
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FIGURE 18 


the y-direction. Now, assuming that the x- and y-deviations from the bull's 
eye are independent of each other, then the product of the two probabilities, 

[fix) dx][g(y) dy\=f(x)g(y) dx dy=f(x)g(y) dA, 

is the probability that the bullet hits the element of area dA shown in the 
figure. Assuming further that the experiment possesses circular symmetry, 
this probability will be the same for any equal element of area at the same 
distance r in any direction from the bull's eye. This amount to assuming that 
f(x)g(y) is a function only of r 2 , 

f(x)g(y)=h(rf, (22) 

where r 2 -x 2 + y 2 . 

Differentiating both sides of (22) first with respect to x and then with 
respect to y gives 

f(x)g(y)=h'(r 2 )' 2* and f{x)g'{y) = h'{r 2 ) ■ 2y. 

By eliminating h'{r 2 ) from these equation we obtain 


mg(y) _ f(x)g'(y) 
2x 2y 


or 

m _ g\y) 

2 xf(x) 2 yg(y)' 


( 23 ) 
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Since the left side is a function of x alone, and the right side is a function of y 
alone, (23) implies that both sides are constant, in particular 

=c or m = 2cx, 


2 xf(x) f{x) 

and this is our desired differential equation. Integration now gives 

log f{x) -cx 2 + d 


or 


fix) = e d -e cx = De c: 


(24) 


where D=e d . But/(x) is a probability density function, so we must have 


j* fix) dx 


= 1 , 


(25) 


and this implies that c must be negative. We are free to write c in the form 
c-- l/2a 2 for a positive constant o, and (24) now becomes 

fix) = De-* 2 ' 2 * 2 . 


By integrating this from to °°, changing the variable of integration from x 
to t = xjc\/2 , and using (10) and (25), we obtain 


dJ e^ 2/2a2 dx = DoV2 J 


e 1 dt = DctV2 Vn = 1. 


—oo 


—00 


Therefore D = l/ a jin and our function takes its final form. 


/(*) = — 


1 

a*j2n 


which is the normal probability density (14) with mean m = 0. 





Chapter 2 

First Order Equations 


7 Homogeneous Equations 

Generally speaking, it is very difficult to solve first order differential equa¬ 
tions. Even the apparently simple equation 


dy 

dx 


f{x,y) 


cannot be solved in general, in the sense that no formulas exist for obtaining 
its solution in all cases. On the other hand, there are certain standard types 
of first order equations for which routine methods of solution are available. 
In this chapter we shall briefly discuss a few of the types that have many 
applications. Since our main purpose is to acquire technical facility, we shall 
completely disregard questions of continuity, differentiability, the possible 
vanishing of divisors, and so on. The relevant problems of a purely math¬ 
ematical nature will be dealt with later, when some of the necessary back¬ 
ground has been developed. 

The simplest of the standard types is that in which the variables are 
separable: 


~j~ = g{x)h{y). 
dx 


As we know, to solve this we have only to write it in the separated form 
dy/h(y) =g(x ) dx and integrate: 




dx + c. 


We have seen many examples of this procedure in the preceding chapter. 
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At the next level of complexity is the homogeneous equation. A function 
fix,\j) is called homogeneous of degree n if 

f(tx, ty) = i n f{x,y) 

for all suitably restricted x, y, and t. This means that if x and y are replaced 
by tx and ty, t n factors out of the resulting function, and the remaining factor 

is the original function. Thus x 2 + xy, ^jx 2 +t/ 2 , and sin (x/y) are homogeneous 
of degrees 2,1, and 0. The differential equation 

M (x, y) dx + N(x, y) dy = 0 

is said to be homogeneous if M and N are homogeneous functions of the same 
degree. This equation can then be written in the form 

(i) 

dx 

where/(x, y)-~ M(x, y)/N(x, y) is clearly homogeneous of degree 0. The proce¬ 
dure for solving (1) rests on the fact that it can alivays be changed into an equa¬ 
tion with separable variables by means of the substitution z=y/x, regardless 
of the form of the function/(x, ty). To see this, we note that the relation 

/(fx, ty) - t°f(x, y) =/(x, y) 

permits us to set f = 1/x and obtain 

f{x, y) =/(l, y/x) =/(l, z). 


Then, since y-zx and 


dy_ 

dx 


z + x 


dz 

dx' 


( 2 ) 


equation (1) becomes 


z + x^ = /(l,z), 
dx 

and the variables can be separated: 

dz _ dx 
f (1, z)—z X ' 

We now complete the solution by integrating and replacing z by y/x. 
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Example 1. Solve (x+y) dx - (x - y) dy = 0. 

We begin by writing the equation in the form suggested by the above 
discussion: 


dy x + y 
dx x-y 

Since the function on the right is clearly homogeneous of degree 0, 
we know that it can be expressed as a function of z=y/x. This is easily 
accomplished by dividing numerator and denominator by x: 

dy _ 1 + y/x _ 1 + z 
dx 1 -y/x 1-z 

We next introduce equation (2) and separate the variables, which gives 

(l-z)dz _ dx 
1 + z 2 ~ x ' 


On integration this yields 


tan 2 z-—log(.l + z 2 ) = logx + c; 


and when z is replaced by y/x, we obtain 

^ y -loU 


tan 


x 2 +y 2 +c 


as the desired solution. 


Problems 

1. Verify that the following equations are homogeneous, and solve them: 

(a) (x 2 - 2y 2 ) dx + xy dy=0 ; 

(b) x 2 y' - 3 xy - 2y 2 = 0; 

(c) x 2 y = 3(x 2 + y 2 )tan _1 — + xy, 


-j, . y dy . y 

(d) xsin — — = y sm- + x. 

x dx x ’ 

(e) xy'=y+2xe~y /x ; 

(f) (x - y) dx - (x + y) dy = 0; 
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(g) xy' = 2x + 3y ; 

(h) xy' = yjx 2 + y 2 ; 

(i) x 2 y' =y 2 + 2xy, 

(j) (x 3 +y 3 )dx-xifdy = 0. 

2. Use rectangular coordinates to find the orthogonal trajectories of the 
family of all circles tangent to the y-axis at the origin. 

3. Show that the substitution z-ax + by + c changes 

y'=f(ax + by+c) 

into an equation with separable variables, and apply this method to 
solve the following equations: 

(a) y' = (x + y) 2 ; 

(b) y' = sin 2 (x -y + 1). 

4. (a) If ae * bd, show that constants h and k can be chosen in such a way 

that the substitutions x = z -h,y = w -k reduce 

dy ax + by + c' 
dx ydx+ey + f 


to a homogeneous equation. 

(b) If ae = bd, discover a substitution that reduces the equation in (a) to 
one in which the variables are separable. 

5. Solve the following equations: 

(a) dy _ x + y + 4 , 


dx x - 


y- 


( b ) dy _ x + y + 4 

dx x + y-6' 

(c) (2x - 2y) dx + (y - 1) dy = 0; 

(d) dy _ x + y-i . 
dx x + 4y + 2' 

(e) (2.x + 3y - 1) dx - 4(x +1 ) dy- 0. 

6. By making the substitution z-y/x n or y = zx n and choosing a conve¬ 
nient value of n, show that the following differential equations can be 
transformed into equations with separable variables, and thereby solve 
them: 


(a) 


dy 1 -xy 2 
dx 2x 2 y 
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(b) 

(c) 


dy 2 + 3xy 2 _ 
dx 4 x 2 y 

d v y~ x v 2 

dx x + x 2 y 


7. Show that a straight line through the origin intersects all integral 
curves of a homogeneous equation at the same angle. 

8 . Let y' =f(x, y) be a homogeneous differential equation, and prove the 
following geometric fact about its family of integral curves: If the xy- 
plane is stretched from (or contracted toward) the origin in such a way 
that each point (x, y) is moved to a new point (x u y,) which is k times 
its original distance from the origin, with its direction from the ori¬ 
gin unchanged, then every integral curve C is carried into an integral 
curve Cj. Hint: x x = kx and y l - ky. 

9. Let y' =f(x, y) be a differential equation whose family of integral curves 
has the geometric property of invariance under stretching which is 
stated in Problem 8, and prove that the equation is homogeneous. 

10. Let a family of curves be integral curves of a differential equation 
y' -fix, y). Let a second family have the property that at each point 
P = (x, y) the angle from the curve of the first family through P to the 
curve of the second family through P is a. Show that the curves of the 
second family are solutions of the differential equation 


f(x,y) + tana 
J l-/(x,y)tana 


11. Use the result of the preceding problem to find the curves that form the 
angle tt/ 4 with 

(a) all straight lines through the origin; 

(b) all circles x 2 + y 2 = c 2 ; 

(c) all hyperbolas x 2 - 2 xy -y 2 =c. 


8 Exact Equations 

If we start with a family of curves/(x, y) = c, then its differential equation can 
be written in the form df - 0 or 


df , 
^-dx + 
dx 


df 

dy 


dy = 0. 
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For example, the family x 1 2 y 3 -c has 2xy 3 dx + 3x 2 y 2 dy = 0 as its differential 
equation. Suppose we turn this situation around, and begin with the dif¬ 
ferential equation 


( 1 ) 

( 2 ) 

then (1) can be written in the form 

—dx + —dy=0 or df = 0 

dx dy J J 

and its general solution is 


M(x, y) dx + N(x, y) dy = 0. 
If there happens to exist a function/(x, y) such that 

— = M and $- = N, 
dx dy 


fix, y) = c. 

In this case the expression M dx + N dy is said to be an exact differential, and 
(1) is called an exact differential equation. 

It is sometimes possible to determine exactness and find the function/by 
mere inspection. Thus the left sides of 


1 X 

y dx + x dy = 0 and — dx - —=■ dy = 0 

y v 

are recognizable as the differentials of xy and x/y, respectively, so the general 
solutions of these equations are xy-c and x/y-c. In all but the simplest cases, 
however, this technique of "solution by insight" is clearly impractical. What 
is needed is a test for exactness and a method for finding the function/ We 
develop this test and method as follows. 

Suppose that (1) is exact, so that there exists a function/satisfying equa¬ 
tions (2). We know from elementary calculus that the mixed second partial 
derivatives of/ are equal: 


d 2 f _ d 2 f } 

dy dx dx dy 


1 The reader should be aware that equation (3) is true whenever both sides exist and are contin¬ 
uous, and that these conditions are satisfied by almost all functions that are likely to arise in 

practice. Our blanket hypothesis throughout this chapter (see the first paragraph in Section 7) 

is that all the functions we discuss are sufficiently continuous and differentiable to guarantee 
the validity of the operations we perform on them. 
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This yields 


8 ^ = f’ ( 4 ) 

dy ox 

so (4) is a necessary condition for the exactness of (1). We shall prove that it 
is also sufficient by showing that (4) enables us to construct a function/that 
satisfies equations (2). We begin by integrating the first of equations (2) with 
respect to x: 

f=jMdx+g(y). (5) 

The "constant of integration" occurring here is an arbitrary function of y 
since it must disappear under differentiation with respect to x. This reduces 
our problem to that of finding a function g(y) with the property that / as 
given by (5) satisfies the second of equations (2). On differentiating (5) with 
respect to y and equating the result to N, we get 

^-Jm dx +g’(y) = N, 


so 


This yields 


g'(y) = N-^Mdx. 




( 6 ) 


provided the integrand here is a function only of y. This will be true if the 
derivative of the integrand with respect to x is 0; and since the derivative in 
question is 


d_ 

dx 


N-±[ 

dyi 


M dx 


dN 

d 2 f,.. 


-1 M dx 

dx 

dxdy J 

dN 

d 2 N.j 


-1 M dx 

dx 

dydxJ 

dN 

dM 

dx 

dy ' 


an appeal to our assumption (4) completes the argument. 
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In summary, we have proved the following statement: equation (1) is exact 
if and only if dM/dy = dN/dx; and in this case, its general solution is f(x, y)-c, 
where f is given by (5) and (6). Two points deserve emphasis: it is the equation 
f(x, y)~c, and not merely the function/ which is the general solution of (1); 
and it is the method embodied in (5) and (6), not the formulas themselves, 
which should be learned. 


Example 1. Test the equation e y dx+{xe y +2y) dy=0 for exactness, and 
solve it if it is exact. 

Here we have 


M=e y and N=xe y +2 y, 


so 


8M 


= e y and 


dN 

dx 


Thus condition (4) is satisfied, and the equation is exact. This tells us that 
there exists a function/(x, y) such that 

— = e v and — = xe il +2y. 
dx 8y 

Integrating the first of these equations with respect to x gives 
/ = J e y dx + g(y) = xe y + g(y), 


SO 


^- = xe y + g\y). 
dy 

Since this partial derivative must also equal xe y + 2y, we have g'{y) = 2y, 
so g(y)=y 2 and f=xe y +y 2 . All that remains is to note that 

xe y +y 2 =c 

is the desired solution of the given differential equation. 


Problems 

Determine which of the following equations are exact, and solve the ones 
that are. 
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1. 


' 2" 

x + — 

y 


dy + y dx = 0. 


2. (sin x tan y +1) dx + cos x sec 2 y dy = 0. 

3. (y - x 3 ) dx + (x + y 3 ) dy = 0. 

4. (2y 2 - 4x + 5) dx = (4 - 2y + 4xy) dy. 

5. (y + y cos xy) dx + (x + x cos xy) dy-0. 

6. cos x cos 2 y dx + 2 sin x sin y cos y dy = 0. 

7. (sin x sin y - xe ; ') dy = (e v + cos x cos y) dx. 

8. - — sin — dx + ^sin — dy = 0. 

y y y y 

9. (1 + y) dx + (1 - x) dy = 0. 

10. (2xy 3 + y cos x) dx + (3x 2 y 2 + sin x) dy = 0. 

y 


2 2 dx + — - 2 T dy ■ 

1 — x y 1-x y 


11. dx = 

12. (2xy 4 + sin y) dx + (4x 2 y 3 + x cos y) dy = 0. 

13. i^f$L + xdx = 0. 


14 


i-*y 

. 2x |l + ^x 2 - y j dx = y]x 2 -y dy. 

15. (x log y + xy) dx + (y log x + xy) dy = 0. 

16. (e ;/ -cscycsc 2 x)dx + (2xye ;/ - esc y cot y cot x) dy = 0. 

17. (1 + y 2 sin 2x) dx - 2y cos 2 x dy = 0. 

x dx y dy 


18. 


(x 2 + y 2 ) 3/2 (* 2 +y 2 ) 3/2 


= 0 . 


19. 3x 2 (l +log y)dx + 

20. Solve 


f 3 
X 


2 y 
y j 


dy = 0. 
y dx - x dy 

(*+y) : 


2 + dy = dx 


as an exact equation in two ways, and reconcile the results. 
21. Solve 


4y -2x _ 8y -x" 

y 2 -j-dx + —— -— dy = 0 


4xy 


4y -x y 


(a) as an exact equation; 

(b) as a homogeneous equation. 
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22. Find the value of n for which each of the following equations is exact, 
and solve the equation for that value of n: 

(a) ( xy 2 + nx 2 y) dx + (x 3 + x 2 y) dy = 0; 

(b) (x + ye 2xy ) dx + nxe 2xy dy = 0. 


9 Integrating Factors 

The reader has probably noticed that exact differential equations are com¬ 
paratively rare, for exactness depends on a precise balance in the form of the 
equation and is easily destroyed by minor changes in this form. Under these 
circumstances, it is reasonable to ask whether exact equations are worth dis¬ 
cussing at all. In the present section we shall try to convince the reader that 
they are. 

The equation 


y dx + ( x 2 y - x) dy-0 (1) 

is easily seen to be nonexact, for dM/dy = 1 and dN/dx = 2xy - 1. Ffowever, if 
we multiply through by the factor 1/x 2 , the equation becomes 


^rdX + 
X 




dy = 0, 


which is exact. To what extent can other nonexact equations be made exact in 
this way? In other words, if 

M(x, y) dx + N(x, y) dy = 0 (2) 

is not exact, under what conditions can a function p(x, y) be found with the 
property that 


p(M dx + N dy )-0 

is exact? Any function p that acts in this way is called an integrating factor for 
(2). Thus 1/x 2 is an integrating factor for (1). We shall prove that (2) always has 
an integrating factor if it has a general solution. 

Assume then that (2) has a general solution 


/(x, y) = c, 
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and eliminate c by differentiating: 

3~dx+^dy = 0. (3) 

dx dy 

It follows from (2) and (3) that 

dy _ M _ df/dx 
dx N df/dy' 


so 


df / dx _ df /dy 
M ~ N 

If we denote the common ratio in (4) by g (x, y), then 

— = gM and — = gN. 

dx dy 


On multiplying (2) by g, it becomes 

gM dx + g N dy = 0 


( 4 ) 


or 


df , 
dx + 
dx 


f dl J = °' 
dy 


which is exact. This argument shows that if (2) has a general solution, then it 
has at least one integrating factor g. Actually it has infinitely many integrat¬ 
ing factors; for if F(f) is any function of f then 

g F(f)(M dx + N dy) = F(f)df = d [J F(f) df , 


so g F(f) is also an integrating factor for (2). 

Our discussion so far has not considered the practical problem of finding 
integrating factors. In general this is quite difficult. There are a few cases, 
however, in which formal procedures are available. To see how these proce¬ 
dures arise, we consider the condition that g be an integrating factor for (2): 

8 (gM) 5(gN) 

dy dx 
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If we write this out, we obtain 



or 




( 5 ) 


jj. ^ dx dy J dy dx 


It appears that we have "reduced" the problem of solving the ordinary dif¬ 
ferential equation (2) to the much more difficult problem of solving the par¬ 
tial differential equation (5). On the other hand, we have no need for the 
general solution of (5) since any particular solution will serve our purpose. 
And from this point of view, (5) is more fruitful than it looks. Suppose, for 
instance, that (2) has an integrating factor p which is a function of x alone. 
Then dp/dx=dp/dx and dp/dy = 0, so (5) can be written in the form 


1 du dM/dy-dN/dx 
p dx N 


( 6 ) 


Since the left side of this is a function only of x, the right side is also. If we put 


DM/dy-dN/dx 


then (6) becomes 



or 




so 


logb= g{x)dx 


and 


p = e- 


(7) 
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This reasoning is obviously reversible: if the expression on the right side of (6) 
is a function only of x, say g(x), then (7) yields a function p that depends only 
on x and satisfies equation (5), and is therefore an integrating factor for (2). 


Example 1. In the case of equation (1) we have 


dM/dy-dN/dx _ \-(2xy-\) _ -2(xy-l) _ _ 2 
N x 2 y - x x(xy - 1) x' 


which is a function only of x. Accordingly, 

= gJ-(2/x) dx — e -2 log x — y-2 

is an integrating factor for (1), as we have already seen. 


Similar reasoning gives the following related procedure, which is appli¬ 
cable whenever (2) has an integrating factor depending only on y. if the 
expression 


dM/dy- dN/dx 
-M 


( 8 ) 


is a function of y alone, say h(y), then 


( 9 ) 


is also a function only of y which satisfies equation (5), and is consequently 
an integrating factor for (2). 

There is another useful technique for converting simple nonexact equa¬ 
tions into exact ones. To illustrate it, we again consider equation (1), rear¬ 
ranged as follows: 


x 2 y dy - {xdy -y dx) = 0. (10) 

The quantity in parentheses should remind the reader of the differential 
formula 



xdy-ydx 


d 


( 11 ) 







78 


Differential Equations with Applications and Historical Notes 


which suggests dividing (10) through by x 2 . This transforms the equation 
into y dy - d(y/x) = 0, so its general solution is evidently 


In effect, we have found an integrating factor for (1) by noticing in it the 
combination x dy - y dx and using (11) to exploit this observation. The fol¬ 
lowing are some other differential formulas that are often useful in similar 
circumstances: 


d 


r \ 
x 

y 


ydx-xdy 
~ 2 ' 
y 


d(xy)-x dy+y dx-, 
d(x 2 + y 2 ) = 2(x dx + y dy); 


d 


tan 


4 X 

y 


ydx-xdy 

2 2 ' 
x +y 


( 12 ) 

(13) 

(14) 

(15) 


f \ 

log- 

v y 


y dx-xdy 
xy 


(16) 


We see from these formulas that the very simple differential equation y dx - 
x dy = 0 has 1/x 2 ,1/y 2 , l/(x 2 + y 2 ), and 1 /xy as integrating factors, and thus can 
be solved in this manner in a variety of ways. 


Example 2. Find the shape of a curved mirror such that light from a 
source at the origin will be reflected in a beam of rays parallel to the 
x-axis. 

By symmetry, the mirror will have the shape of the surface of revolu¬ 
tion generated by revolving a curve APB (Figure 19) about the x-axis. 

It follows from the law of reflection that a = 2p. By the geometry of the 
situation, 4> = P and 0 = a + 4> = 2p. Since tan 0 = y/x and 


2tanp 
1 - tan 2 p ' 


tan0 = tan2p 
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FIGURE 19 


we have 


y 2dy/dx 

x 1 -(dy/dx) 2 


Solving this quadratic equation for dy/dx gives 


dy _ -x± y jx 2 + xj 2 
dx y 


or 


x dx +y dy = ± ^x 2 + y 2 dx. 


By using (14), we get 


± d(x 2 +y 2 ) 
2-y/x 2 +y 2 


= dx, 


so 


±Jx 2 + y 2 =x + c. 
On simplification this yields 


y 2 =2cx+c 2 , 

which is the equation of the family of all parabolas with focus at the ori¬ 
gin and axis the x-axis. It is often shown in elementary calculus that all 
parabolas have this so-called focal property. The conclusion of this exam¬ 
ple is the converse: parabolas are the only curves with this property. 
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Problems 


1. Show that if ( dM/dy - dN/dx)/(Ny - Mx ) is a function g(z) of the product 
z-xy, then 


p-el h z ) 


is an integrating factor for equation (2). 

2. Solve each of the following equations by finding an integrating factor: 

(a) (3x 2 - y 2 ) dy - 2xy dx = 0; 

(b) (xy - 1) dx + (x 2 - xy) dy = 0; 

(c) x dy + y dx + 3x 3 ]f dy = 0; 

(d) e x dx + ( e x cot y+2y esc y) dy = 0; 

(e) (x + 2) sin y dx + x cos y dy = 0; 

(f) y dx + (x - 2x 2 y 3 ) dy = 0; 

(g) (x + 3 y 2 ) dx + 2 xy dy = 0; 

(h) y dx + (2x - ye'J) dy = 0; 

(i) (y log y ~ 2 xy) dx + (x + y) dy = 0; 

(j) (y 2 +xy + l) dx + (x 2 + xy+l) dy = Q; 

(k) (x 3 + xy 3 ) dx + 3y 2 dy = 0. 

3. Under what circumstances will equation (2) have an integrating factor 
that is a function of the sum z = x + y? 

4. Solve the following equations by using the differential formulas 
(12)—(16): 

(a) xdy-y dx = (l+y 2 ) dy; 

(b) y dx - x dy-xy 3 dy, 

(c) x dy = (x 5 + x 3 y 2 + y) dx; 

(d) (y + x) dy=(y - x) dx; 

(e) x dy- (y + x 2 + 9y 2 ) dx; 

(f) (y 2 - y) dx + x dy = 0; 

(g) x dy - y dx = (2x 2 - 3) dx; 

(h) x dy + y dx - Jxy dy; 

(i) (y - xy 2 ) dx + (x + x 2 y 2 ) dy = 0; 

(j) x dy -y dx- x 2 y 4 (x dy + y dx); 

(k) xdy+y dx + x 2 y 5 dy = 0; 

(l) (2xy 2 -y) dx + x dy- 0; 

(m) dy + — dx = sinx dx. 
x 
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5. Solve the following equation by making the substitution z-y/x" or 
y-x n z and choosing a convenient value for n: 


dy 2i/ x 3 u 

— = + — + x tan -Ar. 

dx x y x~ 


6. Find the curve APB in Example 2 by using polar coordinates instead of 
rectangular coordinates. Hint: \|/ + « = tl 


10 Linear Equations 

The most important type of differential equation is the linear equation, 
in which the derivative of highest order is a linear function of the lower order 
derivatives. Thus the general first order linear equation is 

= p(x)y + q(x), 
dx 


the general second order linear equation is 


^= p (*) d l. + vWy + r (*)' 


dx 


dx 


and so on. It is understood that the coefficients on the right in these expres¬ 
sions, namely, p(x), q(x), r(x), etc., are functions of x alone. 

Our present concern is with the general first order linear equation, which 
we write in the standard form 


dy 

dx 


+ P(x)y = Q(x). 


( 1 ) 


The simplest method of solving this depends on the observation that 

d ( f Pdx 1 [Pdxdy „ f Pdx f P dx ( dll ^ 

ixl e y ) e dx +yPe =e \jk +Py \ (2) 

Accordingly, if (1) is multiplied through by e' p dx , it becomes 


±(J 

dx 


V = Qe 


( 3 ) 
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Integration now yields 

\p dx C f P ix , 

e * 1 y - I Qe 1 dx + c, 


so 




( 4 ) 


is the general solution of (1). 


Example 1. Solve — + — y = 3x. 

dx x 

This equation is obviously linear with P = l/x, so we have 

jP dx = j—dx = logx and e^ F ‘ h = e l ° sx = x. 

On multiplying through by x and remembering (3), we obtain 

d 


dx 


(xy)= 3x 


so 

xy=x 3 + c or y=x 2 +cx~ 1 . 

As the method of this example indicates, one should not try to learn the 
complicated formula (4) and apply it mechanically in solving linear equa¬ 
tions. Instead, it is much better to remember and use the procedure by which 
(4) was derived: multiply by el p dx and integrate. One drawback to the above 
discussion is that everything hinges on noticing the fact stated in (2). In other 
words, the integrating factor e- ,,J dx seems to have been plucked mysteriously 
out of thin air. In Problem 1 below we ask the reader to discover it for himself 
by the methods of Section 9. 


Problems 

1. Write equation (1) in the form M dx + N dy = 0 and use the ideas of 

Section 9 to show that this equation has an integrating factor p that is a 

function of x alone. Find u and obtain (4) by solving pM dx + p N dy = 0 
as an exact equation. 
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2. Solve the following as linear equations: 

(a) X Y ~~ 3 y = * 4 ; 

dx 

(b > 

(c) (1 + x 2 ) dxj + 2 xy dx = cot x dx; 

(d) y' + y=2xe~ x +x 2 ; 

(e) y' +y cot x=2x esc x; 

(f) (2y - x 3 ) dx = x dy; 

(g) y - x + xy cot x + xy' = 0; 

(h) — - 2 xy = 6xe x ; 
dx 

(i) (x log x)y' +y-3x 3 ; 

(j) (y - 2 xy - x 2 ) dx + x 2 dy = 0. 

3. The equation 

~j~ + P( x )y - Q( x )y"> 

dx 

which is known as Bernoulli's equation, is linear when n- 0 or 1. Show 
that it can be reduced to a linear equation for any other value of n by the 
change of variable z-y l ~ n , and apply this method to solve the following 
equations: 

(a) xy'+y = xY; 

(b) xy 2 y' + y 3 -x cos x; 

(c) x dy+y dx = xy 2 dx. 

4. The usual notation dy/dx implies that x is the independent variable and 
y is the dependent variable. In trying to solve a differential equation, 
it is sometimes helpful to replace x by y and y by x and work on the 
resulting equation. Apply this method to the following equations: 

(a) (e-V-2 xy)y'=y 2 ; 

(b) y - xy' =y'y 2 e'J; 

(c) xy' + 2 = x 3 {y -\)y'; 

(d) M 2 ™+3f(y)f'(y)x = f'(y). 

5. Find the orthogonal trajectories of the family of curves 

(a) y=x + ce~ x ; 

(b) y 2 -ce x + x+ 1. 
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6. We know from (4) that the general solution of a first order linear equa¬ 
tion is a family of curves of the form 

y = cf{x)+g(x). 

Show, conversely, that the differential equation of any such family is 
linear. 

7. Show that y' + Py=Qy log y can be solved by the change of variable 
z = log y, and apply this method to solve xy' = 2 x 2 y + y log y. 

8. One solution of y' sin 2x = 2y + 2 cos x remains bounded as x -* n/2. 
Find it. 

9. A tank contains 10 gallons of brine in which 2 pounds of salt are dis¬ 
solved. Brine containing 1 pound of salt per gallon is pumped into the 
tank at the rate of 3 gallons/minute, and the stirred mixture is drained 
off at the rate of 4 gallons/minute. Find the amount x = x(f) of salt in the 
tank at any time t. 

10. A tank contains 40 gallons of pure water. Brine with 3 pounds of salt 
per gallon flows in at the rate of 2 gallons/minute, and the stirred mix¬ 
ture flows out at 3 gallons/minute. 

(a) Find the amount of salt in the tank when the brine in it has been 
reduced to 20 gallons. 

(b) When is the amount of salt in the tank largest? 

11. (a) Suppose that a given radioactive element A decomposes into a 

second radioactive element B, and that B in turn decomposes into 
a third element C. If the amount of A present initially is x 0 , if the 
amounts of A and B present at a later time t are x and y, respectively, 
and if /c, and k 2 are the rate constants of these two reactions, find y 
as a function of t. 

(b) Radon (with a half-life of 3.8 days) is an intensely radioactive gas 
that is produced as the immediate product of the decay of radium 
(with a half-life of 1600 years). The atmosphere contains traces of 
radon near the ground as a result of seepage from soil and rocks, all 
of which contain minute quantities of radium. There is concern in 
some parts of the American West about possibly dangerous accu¬ 
mulations of radon in the enclosed basements of houses whose 
concrete foundations and underlying ground contain apprecia¬ 
bly greater quantities of radium than normal because of nearby 
uranium mining. If the rate constants (fractional losses per unit 
time, in years) for the decay of radium and radon are k 1 = 0.00043 
and k 2 = 66, use the result of part (a) to determine how long after 
the completion of a basement the amount of radon will be at a 
maximum. 
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11 Reduction of Order 

As we have seen, the general second order differential equation has the form 

F(x, y, y', y") = 0. 

In this section we consider two special types of second order equations that 
can be solved by first order methods. 


Dependent variable missing. If y is not explicitly present, our equation can 
be written 


f(x, y', y") = 0. 


( 1 ) 


In this case we introduce a new dependent variable p by putting 


y' = p and 



( 2 ) 


This substitution transforms (1) into the first order equation 

0 =o - (3) 

If we can find a solution for (3), we can replace p in this solution by dy/dx and 
attempt to solve the result. This procedure reduces the problem of solving 
the second order equation (1) to that of solving two first order equations in 
succession. 


Example 1. Solve xy" - y' =3x 2 . 

The variable y is missing from this equation, so (2) reduces it to 


x 


dp 

dx 


-p = 3x 2 


dp 

dx 


1 , 

—p = 3x, 
x 


or 
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which is linear. On solving this by the method of Section 10, we obtain 

dy _ 2 

p = — = 3x + C\X, 
dx 


so 


3 1 2 

y = x + — C\X + C 2 


is the desired solution. 

Independent variable missing. If x is not explicitly present, our second 
order equation can be written 

g{y,y',y")= o. (4) 


Here we introduce our new dependent variable p in the same way, but this 
time we express y" in terms of a derivative with respect to y: 


y =1 


and 


y = 


dx 


dp dy 
dy dx 


dy 


( 5 ) 


This enables us to write (4) in the form 


g 


dp 


VrV’V , 
v d Vj 


= 0 ; 


( 6 ) 


and from this point on we proceed as above, solving two first order equa¬ 
tions in succession. 


Example 2. Solve y" +k 2 y= 0. 

With the aid of (5), we can write this in the form 

p— + k 2 y = 0 or p dp + k 2 y dy = 0. 

dy 


Integration yields 


p 2 + k 2 y 2 =k 2 a 2 , 
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so 

V = j x = ik^a * 1 2 - y 2 
or 



= ±kdx. 


A second integration gives 

sin -1 — = ±kx + b, 
a 


so 


y=a sin (±kx + b) or y=A sin (kx + B). 

This general solution can also be written as 

y=c 1 sin kx + c 2 cos kx, 

by expanding sin (kx+B) and changing the form of the constants. 


( 7 ) 


The equation solved in Example 2 occurs quite often in applications (see 
Section 5). It is linear, and its solution (7) will be fitted into the general theory 
of second order linear equations in the next chapter. 


Problems 

1. Solve the following equations: 

(a) yy" + (y') 2 -0; 

(b) xy” -y' + (y') 3 ; 

(c) y" - k 2 y = 0; 

(d) x 2 y" = 2xy’ + (y') 2 ; 

(e) 2yy" = 1 + (y') 2 ; 

(f) yy" - (y') 2 =0; 

(g) xy"+y' = 4x. 

2. Find the specified particular solution of each of the following equations: 

(a) (x 2 + 2y')y" + 2 xy' = 0, y = 1 and y' - 0 when x - 0; 

(b) yy" = y 2 y' + (y') 2 , y = - 1/2 and y' = 1 when x - 0; 

(c) y" = y'ev, y = 0 and y' = 2 when x = 0. 
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3. Solve each of the following equations by both methods of this section, 
and reconcile the results: 

(a) y" = 1 + (y') 2 ; 

(b) y" + (y') 2 =l. 

4. In Problem 5-8 we considered a hole drilled through the earth from 
pole to pole and a rock dropped into the hole. This rock will fall through 
the hole, pause at the other end, and return to its starting point. How 
long will this complete round trip take? 

5. Consider a wire bent into the shape of the cycloid whose parametric 
equations are x = a(Q - sin 0) and y = a(l - cos 0), and invert it as in Figure 
10. If a bead is released on the wire and slides without friction and 
under the influence of gravity alone, show that its velocity v satisfies 
the equation 


Aav 2 =g(so-s 2 ), 

where s 0 and s are the arc lengths from the bead's lowest point to the 
bead's initial position and its position at any later time, respectively. By 
differentiation obtain the equation 


and from this find s as a function of t and determine the period of the 
motion. Note that these results establish once again the tautochrone 
property of the cycloid discussed in Problem 6-5. 


12 The Hanging Chain. Pursuit Curves 

We now discuss several applications leading to differential equations that 
can be solved by the methods of this chapter. 


Example 1. Find the shape assumed by a flexible chain suspended 
between two points and hanging under its own weight. 

Let the y-axis pass through the lowest point of the chain (Figure 20), let 
s be the arc length from this point to a variable point (x, y), and let zv(s) be 
the linear density of the chain. We obtain the equation of the curve from 
the fact that the portion of the chain between the lowest point and (x, y) 
is in equilibrium under the action of three forces; the horizontal tension 
T 0 at the lowest point; the variable tension T at (x, y), which acts along 
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FIGURE 20 


the tangent because of the flexibility of the chain; and a downward force 
equal to the weight of the chain between these two points. Equating the 
horizontal component of T to T 0 and the vertical component of T to the 
weight of the chain gives 


T cos 0 = T 0 and TsinO = J zy(s) ds. 

o 


It follows from the first of these equations that 

T sin 0 = T 0 tan 0 = T O ^ ; 

dx 


so 


T 0 y' = j" iv(s)ds. 
0 


We eliminate the integral here by differentiating with respect to x\ 

s s 

T 0 y" =— f iv(s)ds = — f w(s)ds- 
J dx J dsJ i 

0 0 

= w(s)yjl + {y'f. 


ds 

dx 
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Thus 


T 0 y" = zv(s)jl + (y') 2 


(1) 


is the differential equation of the desired curve, and the curve itself is 
found by solving this equation. To proceed further, we must have defi¬ 
nite information about the function w(s). We shall solve (1) for the case in 
which w (s) is a constant w a so that 


V" = n^l + (y'?, 



( 2 ) 


On substituting y'=p and y" = dp/dx, as in Section 11, equation (2) 
reduces to 


dp 


= a 


dx. 


( 3 ) 


We now integrate (3) and use the fact that p = 0 when x = 0 to obtain 


log {p+yjuy)-- 


Solving for p yields 


P = 


dy_ 

dx 



e- x ). 


If we place the x-axis at the proper height, so that i/ = l/a when x = 0, 
we get 


y 


= — (e“ + e ax ) = —cosh ax 
2 a a 


as the equation of the curve assumed by a uniform flexible chain hang¬ 
ing under its own weight. This curve is called a catenary, from the Latin 
word for chain, catena. Catenaries also arise in other interesting prob¬ 
lems. For instance, it will be shown in Chapter 12 that if an arc joining 
two given points and lying above the x-axis is revolved about this axis, 
then the area of the resulting surface of revolution is smallest when the 
arc is part of a catenary. 
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Example 2. A point P is dragged along the ry-plane by a string PT of 
length a. If T starts at the origin and moves along the positive y-axis, and 
if P starts at (a, 0), what is the path of P ? This curve is called a tmctrix 
(from the Latin tractum, meaning drag). 

It is easy to see from Figure 21 that the differential equation of the 
path is 


dy _ y Ja 2 - x 2 
dx x 


On separating variables and integrating, and using the fact that y=0 
when x = a, we find that 


y = a log 


a + yfa 


- yja 2 -x 2 


is the equation of the tractrix. This curve is of considerable importance 
in geometry, because the trumpet-shaped surface obtained by revolving 
it about the y-axis is a model for Lobachevsky's version of non-Euclidean 
geometry, since the sum of the angles of any triangle drawn on the sur¬ 
face is less than 180°. Also, in the context of differential geometry this 
surface is called a pseudosphere, because it has constant negative curva¬ 
ture as opposed to the constant positive curvature of a sphere. 



FIGURE 21 
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Example 3. A rabbit starts at the origin and runs up the i/-axis with 
speed a. At the same time a dog, running with speed b, starts at the point 
(c, 0) and pursues the rabbit. What is the path of the dog? 

At time t, measured from the instant both start, the rabbit will be at the 
point R = (0, at) and the dog at D = ( x , y) (Figure 22). Since the line DR is 
tangent to the path, we have 


dy = y-at 

dx x 


or 


xy' -y = -at. 


( 4 ) 


To eliminate t, we begin by differentiating (4) with respect to x, which 
gives 


xy = -a 


dt 

dx 


( 5 ) 


Since ds/dt = b, we have 


dt dt els _L r. 2 

— =-= —Jl + (v) , 

dx ds dx b ’ 


( 6 ) 


where the minus sign appears because s increases as x decreases. When 
(5) and (6) are combined, we obtain the differential equation of the path: 


xy"=kyjl + (y') 2 , k=^. 


( 7 ) 



FIGURE 22 
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The substitution y' =p and y" =dp/dx reduces (7) to 

dp , dx 

V 1 + p 2 ” T ' 

and on integrating and using the initial condition p = 0 when x = c, we find 
that 


log (y+^i+r) 



This can readily be solved for p, yielding 


dx 2 ^ c J yxj 


In order to continue and find y as a function of x, we must have further 
information about k. We ask the reader to explore some of the possibili¬ 
ties in Problem 8. 


Example 4. The y-axis and the line x = c are the banks of a river whose 
current has uniform speed a in the negative y-direction. A boat enters 
the river at the point (c,0) and heads directly toward the origin with 
speed b relative to the water. What is the path of the boat? 

The components of the boat's velocity (Figure 23) are 

— = —b cos 0 and — = -a + bsinO, 

dt dt 


so 


dy _-a + bsmQ _- a + b (-y ^ x2 + r) 
dx -b cosB -b^x/yjx 2 +y 2 j 

_ fl V* 2 +y 2 + b V 

bx 


This equation is homogeneous, and its solution as found by the method 
of Section 7 is 


k +1 


c k (i/ + y fi 2 +y 2 ) = x l 
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FIGURE 23 


where k=a/b. It is clear that the fate of the boat depends on the relation 
between a and b. In Problem 9 we ask the reader to discover under what 
circumstances the boat will be able to land, and where. 


Problems 

1. In Example 1, show that the tension T at an arbitrary point ( x,y) on the 
chain is given by w 0 y. 

2. If the chain in Example 1 supports a load of horizontal density L(x), 
what differential equation should be used in place of (1)? 

3. What is the shape of a cable of negligible density [so that w(s) - 0] that 
supports a bridge of constant horizontal density given by L(x) = L 0 ? 

4. If the length of any small portion of an elastic cable of uniform density 
is proportional to the tension in it, show that it assumes the shape of a 
parabola when hanging under its own weight. 

5. A curtain is made by hanging thin rods from a cord of negligible den¬ 
sity. If the rods are close together and equally spaced horizontally, and 
if the bottom of the curtain is trimmed to be horizontal, what is the 
shape of the cord? 

6. What curve lying above the x-axis has the property that the length of 
the arc joining any two points on it is proportional to the area under 
that arc? 
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7. Show that the tractrix in Example 2 is orthogonal to the lower half of 
each circle with radius a and center on the positive y-axis. 

8. (a) In Example 3, assume that a < b (so that k < 1) and find y as a func¬ 

tion of x. How far does the rabbit run before the dog catches him? 

(b) Assume that a=b and find y as a function of x. How close does the 
dog come to the rabbit? 

9. In Example 4, solve the equation of the path for y and determine con¬ 
ditions on a and b that will allow the boat to reach the opposite bank. 
Where will it land? 


13 Simple Electric Circuits 

In the present section we consider the linear differential equations that gov¬ 
ern the flow of electricity in the simple circuit shown in Figure 24. This cir¬ 
cuit consists of four elements whose action can be understood quite easily 
without any special knowledge of electricity. 

A. A source of electromotive force (emf) £—perhaps a battery or gen¬ 
erator—which drives electric charge and produces a current I. 
Depending on the nature of the source, £ may be a constant or a 
function of time. 



FIGURE 24 














96 


Differential Equations with Applications and Historical Notes 


B. A resistor of resistance R, which opposes the current by producing a 
drop in emf of magnitude 

Er = RL 

This equation is called Ohm's law. 2 

C. An inductor of inductance L, which opposes any change in the cur¬ 
rent by producing a drop in emf of magnitude 



D. A capacitor (or condenser) of capacitance C, which stores the 
charge Q. The charge accumulated by the capacitor resists the inflow 
of additional charge, and the drop in emf arising in this way is 



Furthermore, since the current is the rate of flow of charge, and 
hence the rate at which charge builds up on the capacitor, we have 

I = d Q 

dt 

Students who are unfamiliar with electric circuits may find it helpful to 
think of the current I as analogous to the rate of flow of water in a pipe. 
The electromotive force E plays the role of a pump producing pressure (volt¬ 
age) that causes the water to flow. The resistance R is analogous to friction 
in the pipe, which opposes the flow by producing a drop in the pressure. 
The inductance L is a kind of inertia that opposes any change in the flow 
by producing a drop in pressure if the flow is increasing and an increase in 
pressure if the flow is decreasing. The best way to think of the capacitor is to 
visualize a cylindrical storage tank that the water enters through a hole in 
the bottom: the deeper the water is in the tank (Q), the harder it is to pump 
more water in; and the larger the base of the tank is (C) for a given quantity 


2 Georg Simon Ohm (1787-1854) was a German physicist whose only significant contribution 
to science was his discovery of the law stated above. When he announced it in 1827 it seemed 
too good to be true, and was not believed. Ohm was considered unreliable because of this, 
and was so badly treated that he resigned his professorship at Cologne and lived for several 
years in obscurity and poverty before it was recognized that he was right. One of his pupils 
in Cologne was Peter Dirichlet, who later became one of the most eminent German mathema¬ 
ticians of the nineteenth century. 
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of stored water, the shallower the water is in the tank and the easier it is to 
pump more water in. 

These circuit elements act together in accordance with Kirchhoffs law, 
which states that the algebraic sum of the electromotive forces around a 
closed circuit is zero. 3 This principle yields 

E — E R — E L — E c =0 


or 


JT -1 

E-R1-L — -—Q = 0, 
dt C 


which we rewrite in the form 


L^ + RI + ±Q = E. (1) 

dt L 

Depending on the circumstances, we may wish to regard either I or Q as the 
dependent variable. In the first case, we eliminate Q by differentiating (1) 
with respect to t and replacing dQ/dt by I: 


T d 2 I dl 1 T dE 

L —=- + R — + — I = —. 
dt~ dt C dt 


In the second case, we simply replace I by dQ/dt: 


t ^Q 

dt 2 


+ R^- + -Q = E. 
dt C 


( 2 ) 


( 3 ) 


We shall consider these second order linear equations in more detail later. 
Our concern in this section is primarily with the first order linear equation 

L — + RI=E (4) 

dt 

obtained from (1) when no capacitor is present. 


3 Gustav Robert Kirchhoff (1824-1887) was another German scientist whose work on electric 
circuits is familiar to every student of elementary physics. He also established the principles 
of spectrum analysis and paved the way for the applications of spectroscopy in determining 
the chemical constitution of the stars. 



98 


Differential Equations with Applications and Historical Notes 


Example 1. Solve equation (4) for the case in which an initial current I 0 
is flowing and a constant emf E 0 is impressed on the circuit at time t = 0. 
For t > 0, our equation is 


L — + R1 = E 0 . 
dt 

The variables can be separated, yielding 

-!"- = ! dt 
E 0 -RI L 

On integrating and using the initial condition I = I 0 when t = 0, we get 
log(E 0 -RI) = -*t + log(E 0 - RI 0 ), 




Note that the current I consists of a steady-state part E 0 /R and a transient 
part (I 0 - E 0 /R)e~ R,/L that approaches zero as t increases. Consequently, 
Ohm's law E 0 =RI is nearly true for large t. We also observe that if I 0 = 0, 
then 


I = —(l-e“ Rf/i ), 
R 


and if E 0 = 0, then I=I 0 e~ Rt,L . 


Problems 

1. In Example 1, with I 0 - 0 and E g * 0, show that the current in the circuit 
builds up to half its theoretical maximum in (L log 2 )/R seconds. 

2. Solve equation (4) for the case in which the circuit has an initial current 
I 0 and the emf impressed at time f = 0 is given by 

(a) E = E 0 e~ kt ; 

(b) E = E 0 sin cot. 
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3. Consider a circuit described by equation (4) and show that: 

(a) Ohm's law is satisfied whenever the current is at a maximum or 
minimum. 

(b) The emf is increasing when the current is at a minimum and 
decreasing when it is at a maximum. 

4. If L = 0 in equation (3), and if Q = 0 when t = 0, find the charge buildup 
Q = Q(t) on the capacitor in each of the following cases: 

(a) £ is a constant E 0 ; 

(b) E = E 0 e~‘; 

(c) E = E 0 cos cot. 

5. Use equation (1) with R = 0 and £ = 0 to find Q = Q(t ) and I=I(t) for the 
discharge of a capacitor through an inductor of inductance L, with ini¬ 
tial conditions Q = Q 0 and 1=0 when t = 0. 


Miscellaneous Problems for Chapter 2 

Among the following 50 differential equations are representatives of all 
the types discussed in this chapter, in random order. Many are solvable by 
several methods. They are presented for the use of students who wish to 
practice identifying the method or methods applicable to a given equation, 
without having the hint provided by the title of the section in which the 
equation occurs. 


i- yy"Hy'f- 

2. (1 -xy)y' = y 2 . 

3. (2x + 3y +1) dx + (2y - 3x + 5) dy = 0. 



5. y 2 dx = (x 3 - xy) dy. 

6. (x 2 y 3 + y)dx = (x 3 y 2 - x) dy. 

7. yy" + (y') 2 -2yy' = 0. 

8. x dy+y dx=x cos x dx. 

9. xy dy = x 2 dy + y 2 dx. 

10. (e x -3x 2 y 2 )y'+ye x = 2xy 3 . 

11. y" + 2x(y') 2 = 0. 

12. (x 2 + y) dx =x dy. 

13. xy' + y=x 2 cos x. 
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14. (6x + 4y + 3) dx + (3x + 2y + 2) dy = 0. 

15. cos (x + y) dx-x sin (x + y) dx + x sin (x + y) dy. 

16. x 2 y" + xy' = 1. 

17. (y 2 e* y + cos x) dx + (e*y+xye* y ) dy = 0. 


18. y' log (x - y) = 1 + log (x - y). 

19. y' + 2xy = e'* 2 . 

20. (y 2 - 3xy - 2x 2 ) dx = (x 2 - xy) dy. 

21. (l+x 2 )y' + 2xy = 4x 3 . 

22. e x sin y dx + e x cos y dy = y sin xy dx + x sin xy dy. 

23. (l+x 2 )y"+xy' = 0. 

24. (xe y + y - x 2 ) dy = (2xy - e y - x) dx. 

25. e*(l + x) dx- (xe x - ye y ) dy. 

26. (x 2 y 4 + x 6 ) dx - x 3 y 3 dy = 0. 

27. y' = 1 + 3y tan x. 



29. 


30. 


Ixye' 


{*lvr 


dy = _ 
dx y 2 + yy*/y) 2 + 2 xV* /y)2 

dy _ x + 2y + 2 


dx 


-2x + y 


31. 3x 2 logy dxH-dy = 0. 

32. —dx + f2ylog +3sinyldy = 0. 

x “I - 3x l x “I - 3 1 


33. 


y~* 


dx-- 


2x 


(x + y) 3 (x + y) 3 


dy = 0. 


34. (xy 2 + y) dx + xdy- 0. 

35. x 2 y" = y'(3x - 2y'). 

36. (3x 2 y - y 3 ) dx - (3xy 2 - x 3 ) dy = 0. 

37. x(x 2 + l)y' + 2y = (x 2 +1) 3 . 


38 _ dy -3x - 2y -1 
dx 2x + 3y -1 

39. e* 2y (l + 2x 2 y) dx + x 3 e x y dy = 0. 

40. (3x 2 e y - 2x) dx + (x 3 e y - sin y) dy = 0. 

41. y 2 y" + (y') 3 -0. 

42. (3xy + y 2 ) dx + (3xy + x 2 ) dy = 0. 
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43. x 2 y' = x 2 + xy + y 2 . 

44. xy' + y = y 2 logx. 



/ 


— dy = 0 

yj 


46. x 2 y" + (y') 2 = 0. 

47. {xy + y - 1) dx + x dy = 0. 

48. x 2 y' - y 2 = 2xy. 

49. y" = 2y(y') 3 . 
dX 

50. — + x cot y = sec y. 


dy 


51. A tank contains 50 gallons of brine in which 25 pounds of salt are 
dissolved. Beginning at time t-0, water runs into this tank at the 
rate of 2 gallons/minute, and the mixture flows out at the same rate 
through a second tank initially containing 50 gallons of pure water. 
When will the second tank contain the greatest amount of salt? 

52. A natural extension of the first order linear equation 


y’=p(x) + q(x)y 


is the Riccati equation 4 


y' = p(x) + y(x)y + r(x)y 2 . 


In general, this equation cannot be solved by elementary methods. 
However, if a particular solution y,(x) is known, then the general solu¬ 
tion has the form 


y(x)=yi(x) + z(x) 


4 Count Jacopo Francesco Riccati (1676-1754) was an Italian savant who wrote on mathematics, 
physics, and philosophy. He was chiefly responsible for introducing the ideas of Newton to 
Italy. At one point he was offered the presidency of the St. Petersburg Academy of Sciences, 
but understandably he preferred the leisure and comfort of his aristocratic life in Italy to 
administrative responsibilities in Russia. Though widely known in scientific circles of his 
time, he now survives only through the differential equation bearing his name. Even this 
was an accident of history, for Riccati merely discussed special cases of this equation without 
offering any solutions, and most of these special cases were successfully treated by various 
members of the Bernoulli family. The details of this complex story can be found in G. N. 
Watson, A Treatise on the Theory of Bessel Functions, 2d ed., pp. 1-3, Cambridge University 
Press, London, 1944. The special Riccati equation y' + by 2 = ex’" is known to be solvable in finite 
terms if and only if the exponent m is -2 or of the form -4k/(2A:+l) for some integer k (see 
problem 47-8). 
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where z(x) is the general solution of the Bernoulli equation 
z' - (q + 2ryfz = rz 2 . 

Prove this, and find the general solution of the equation 


y' = — + x 3 y 2 -x 5 , 
x 


which has yfx)-x as an obvious particular solution. 

53. The propagation of a single act in a large population (for example, buy¬ 
ing a Japanese- or German-made car) often depends partly on exter¬ 
nal circumstances (price, quality, and frequency-of-repair records) and 
partly on a human tendency to imitate other people who have already 
performed the same act. In this case the rate of increase of the propor¬ 
tion y(t) of people who have performed the act can be expressed by the 
formula 


%m{l-yMt) + lyl (*) 

dr 

where s(t) measures the external stimulus and I is a constant called the 

imitation coefficient. 5 

(a) Notice that (*) is a Riccati equation and that y = 1 is an obvious solu¬ 
tion, and use the result of Problem 52 to find the Bernoulli equation 
satisfied by z(f). 

(b) Find y(t) for the case in which the external stimulus increases 
steadily with time, so that s(t) - at for a positive constant a. Leave 
your answer in the form of an integral. 

54. (a) If Riccati's equation in Problem 52 has a known solution yfx), show 
that the general solution has the form of the one-parameter family 
of curves 

<f(x) + g(x) ' 

cF(x) + G(x) 

(b) Show, conversely, that the differential equation of any one-parame¬ 
ter family of this form is a Riccati equation. 


5 See Anatol Rapoport, "Contribution to the Mathematical Theory of Mass Behavior: I. The 
Propagation of Single Acts," Bulletin of Mathematical Biophysics, Vol. 14, pp. 159-169 (1952). 
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Dynamical problems with variable mass. In the preceding pages, we 
have considered many applications of Newton's second law of motion 
in the form given in Section 1: 

F = ma, 

where F is the force acting on a body of mass m whose acceleration 
is a. It should be realized, however, that this formulation applies only 
to situations in which the mass is constant. Newton's law is actually 
somewhat more general, and states that when a force F acts on a body 
of mass m, it produces momentum (mv, where v is the velocity) at a rate 
equal to the force: 


F = —(mv). 
dt V ' 


This equation reduces to F = ma when m is constant. In applying this 
form of the law to a moving body with variable mass, it is necessary 
to distinguish momentum produced by F from momentum produced 
by mass joining the body from an outside source. Thus, if mass with 
velocity v + zv (so that zv is its velocity relative to m) is being added to 
m at the rate dm/dt, the effect of F in increasing momentum must be 
supplemented by ( v + zv) dm/dt, giving 

V ' dt dt V 


which simplifies to 


w 


dm 

dt 


+ F = 


m 


dv 

dt 


We note that dm/dt is positive or negative according as the body is gain¬ 
ing or losing mass, and that zv is positive or negative depending on the 
motion of the mass gained or lost relative to m. The following problems 
provide several illustrations of these ideas. 

55. A rocket of structural mass m 1 contains fuel of initial mass m 2 . It is fired 
straight up from the surface of the earth by burning fuel at a constant 
rate a (so that dm/dt = -a where in is the variable total mass of the rocket) 
and expelling the exhaust products backward at a constant velocity 
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b relative to the rocket. Neglecting all external forces except a gravi¬ 
tational force mg, where g is assumed constant, find the velocity and 
height attained at the moment when the fuel is exhausted (the burnout 
velocity and burnout height). 6 

56. A spherical raindrop, starting from rest, falls under the influence of 
gravity. If it gathers in water vapor (assumed at rest) at a rate propor¬ 
tional to its surface, and if its initial radius is 0, show that it falls with 
constant acceleration g/ 4. 

57. If the initial radius of the raindrop in Problem 56 is r 0 and r is its radius 
at time t, show that its acceleration at time t is 


1 

4 


/ 

1 + 

v 



Thus the acceleration is constant—with value g/A —if and only if the 
raindrop has zero initial radius. 

58. A spherical raindrop, starting from rest, falls through a uniform mist. If 
it gathers in water droplets in its path (assumed at rest) as it moves, and 
if its initial radius is 0, show that it falls with constant acceleration g/7. 

59. Einstein's special theory of relativity asserts that the mass m of a par¬ 
ticle moving with velocity v is given by the formula 


m = 



o 


where c is the velocity of light and m 0 is the rest mass. 

(a) If the particle starts from rest in empty space and moves for a long 
time under the influence of a constant gravitational field, find v as a 
function of time by taking w = -v, and show that v -*■ c as t -* °°. 7 


6 The experience of engineering experts strongly suggests that no foreseeable combination of 
fuel and rocket design will enable a rocket, starting from rest, to acquire a burnout velocity as 
large as the escape velocity jlgR. This means that single-stage rockets of this kind cannot be 
used for journeys into space from the surface of the earth, and all such journeys will continue 
to require the multistage rockets familiar to us from recent decades. 

7 Enrico Fermi has suggested that the phenomenon described here, transferred to the case of 
charged particles of interstellar dust accelerated by the magnetic fields of stars, can account 
in part for the origin of primary cosmic rays. 
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(b) Let M = m - m 0 be the increase in the mass of the particle. If the cor¬ 
responding increase £ in its energy is taken to be the work done on 
it by the prevailing force F, so that 


E = 


\ 


F dx = 



dx = 



verify that 


£ -Me 1 . 


n 


(c) Deduce (*) from (**). 
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Chapter 3 

Second Order Linear Equations 


14 Introduction 

In the preceding chapters we studied a few restricted types of differential 
equations that can be solved in terms of familiar elementary functions. 
The methods we developed require considerable skill in the techniques of 
integration, and their many interesting applications have a tasty flavor of 
practicality. Unfortunately, however, it must be admitted that this part of the 
subject tends to be a miscellaneous bag of tricks, and conveys little insight 
into the general nature of differential equations and their solutions. In the 
present chapter we discuss an important class of equations with a rich and 
far-reaching theory. We shall see that this theory can be given a coherent and 
satisfying structure based on a few simple principles. 

The general second order linear differential equation is 

^\ + P(x)^ + Q(x)y = R(x), 
dx dx 


or, more simply. 


y"+P(x)y'+Q(x)y=R(x). (1) 

As the notation indicates, it is understood that P(x), Q(x), and R(x) are func¬ 
tions of x alone (or perhaps constants). It is clear that no loss of generality 
results from taking the coefficient of y" to be 1, since this can always be 
accomplished by division. Equations of this kind are of great significance 
in physics, especially in connection with vibrations in mechanics and the 
theory of electric circuits. In addition—as we shall see in later chapters— 
many profound and beautiful ideas in pure mathematics have grown out of 
the study of these equations. 

We should not be misled by the fact that first order linear equations are 
easily solved by means of formulas. In general, (1) cannot be solved explic¬ 
itly in terms of known elementary functions, or even in terms of indicated 
integrations. To find solutions, it is commonly necessary to resort to infinite 
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processes of one kind or another, usually infinite series. Many special equa¬ 
tions of particular importance in applications, for instance those of Legendre 
and Bessel mentioned in Section 1, have been studied at great length; and the 
theory of a single such equation has often been found so complicated as to 
constitute by itself an entire department of analysis. We shall discuss these 
matters in Chapters 5 and 8. 

In this chapter our detailed consideration of actual methods for solving 
(1) will be restricted, for the most part, to the special case in which the coef¬ 
ficients P(x) and Q(x) are constants. It should also be emphasized that most 
of the ideas and procedures we discuss can be generalized at once to linear 
equations of higher order, with no change in the underlying principles but 
only an increasing complexity in the surrounding details. By restricting our¬ 
selves for the most part to second order equations, we attain as much sim¬ 
plicity as possible without distorting the main ideas, and yet we still have 
enough generality to include all the linear equations of greatest interest in 
mathematics and physics. 

Since in general it is not possible to produce an explicit solution of (1) for 
inspection, our first order of business is to assure ourselves that this equa¬ 
tion really has a solution. The following existence and uniqueness theorem 
is proved in Chapter 13. 


Theorem A. Let P(x), Q(x), and R(x) he continuous functions on a closed interval 
[a,b]} Ifx 0 is any point in [a,b], and ify 0 and y' 0 are any numbers whatever, then equa¬ 
tion (1) has one and only one solution y(x) on the entire interval such that y(x 0 ) = y 0 
and y'(x 0 ) = y' 0 . 


Thus, under these hypotheses, at any given point x {) in [a,b] we can arbitrarily 
prescribe the values of y(x) and y'(x), and there will then exist precisely one 
solution of (1) on [a,b\ that assumes the prescribed values at the given point; 
or, more geometrically, (1) has a unique solution on [a,b] that passes through 
a specified point (x 0 ,y 0 ) with a specified slope y' 0 . In our general discussions 
through the remainder of this chapter, we shall always assume (without nec¬ 
essarily saying so explicitly) that the hypotheses of Theorem A are satisfied. 

Example 1. Find the solution of the initial value problem 

y"+y = 0, y(0) and y'(0) = 1. 

We know that y = sin x, y =cos x, and more generally y = sin x + c 2 cos x 
for any constants q and c 2 , are all solutions of the differential equation. 


1 If a and b are real numbers such that a < b, then the symbol [a,b] denotes the interval consist¬ 
ing of all real numbers x that satisfy the inequalities a < x <b. This interval is called closed 
because it contains its endpoints. The open interval resulting from the exclusion of the end¬ 
points is denoted by ( a,b ) and is defined by the inequalities a <x <b. 
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Also, y = sin x clearly satisfies the initial conditions, because sin 0 = 0 and 
cos 0 = 1. By Theorem A, y = sin x is the only solution of the given initial 
value problem, and is therefore completely characterized as a function 
by this problem. In just the same way, the function y = cos x is easily seen 
to be a solution, and therefore the only solution, of the corresponding 
initial value problem 


V"+y= 0, J/(0) = 1 , and y'(0) = 0. 

Since all of trigonometry can be regarded as the development of the 
properties of these two functions, it follows that all of trigonometry is 
contained by implication (as the acorn contains the oak tree) within the 
two initial value problems stated above. We shall examine this remark¬ 
able idea in greater detail in Chapter 4. 


We emphasize again that in Theorem A the initial conditions that determine 
a unique solution of equation (1) are conditions on the value of the solution 
and its first derivative at a single fixed point x 0 in the interval [a,b\. In contrast 
to this, the problem of finding a solution of equation (1) that satisfies condi¬ 
tions of the form y(x 0 ) =y 0 and y(x 1 )-y 1 , where x 0 and x, are different points 
in the interval, is not covered by Theorem A. Problems of this kind are called 
boundary value problems, and are discussed in Chapter 7. 

The term R(x) in equation (1) is isolated from the others and written on the 
right because it does not contain the dependent variable y or any of its deriv¬ 
atives. If R(x) is identically zero, then (1) reduces to the homogeneous equation 

y" + P(x)y' + Q(x)y = 0. (2) 

(This traditional use of the word homogeneous should not be confused with 
the equally traditional but totally different use given in Section 7.) If R(x) is 
not identically zero, then (1) is said to be nonhomogeneous. 

In studying the nonhomogeneous equation (1) it is necessary to consider 
along with it the homogeneous equation (2) obtained from it by replacing 
R(x) by 0. Under these circumstances (1) is often called the complete equation, 
and (2) the reduced equation associated with it. The reason for this linkage 
between (1) and (2) is easy to understand, as follows. 

Suppose that in some way we know that y v (x,c ] ,c 2 ) is the general solution of 
(2)—we expect it to contain two arbitrary constants since the equation is of 
the second order—and that y p (x) is a fixed particular solution of (1). If y(x) is 
any solution whatever of (1), then an easy calculation shows that y(x) - y p (x) 
is a solution of (2): 

(y - y v )" + p M(y - y?)'+Q(*)(y - y?) 

=[y"+ P(*)y' +QMy] - [y" + P(x)y' F + Q(*)y P ] 

-R(x)-R(x) = 0. 


( 3 ) 
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Since y g (x,c v cf) is the general solution of (2), it follows that y(x) - y p (x) = 
y y (x,c v c ’fj or 


y(x)=y g (x,c 1 ,c 2 )+y p (x) 

for a suitable choice of the constants c 1 and c 2 . This argument proves the fol¬ 
lowing theorem. 

Theorem B. If y g is the general solution of the reduced equation (2) and y p is any 
particular solution of the complete equation (1), then y g +y v is the general solution 
o/(l). 

We shall see in Section 19 that if y g is known, then a formal procedure is 
available for finding y p . This shows that the central problem in the theory of 
linear equations is that of solving the homogeneous equation. Accordingly, 
most of our attention will be devoted to studying the structure of y g and 
investigating various ways of determining its explicit form—none of which 
is effective in all cases. 

The first thing we should notice about the homogeneous equation (2) is 
that the function y(x) which is identically zero—that is, y(x) = 0 for all x —is 
always a solution. This is called the trivial solution, and is usually of no inter¬ 
est. The basic structural fact about solutions of (2) is given in the following 
theorem. 


Theorem C.Ifyfx) and y 2 (x) are any two solutions of( 2), then 

cyyfx) + c 2 y 2 (x) (4) 

is also a solution for any constants q and 02 - 

Proof. The statement follows immediately from the fact that 

(cij/i + c 2 y 2 )" + P(x)( c iyi + C 2 J/ 2 )' + Q(x)(c 1 y 1 + C 2 J/ 2 ) 

= (cry!+£23/2)+ P( x )( c iy'i + c 2 y'i) +QM( c i 3 /i + ^23/2) 

= Ci[y”i + P(x)y[ + Q(x)i/i] + c 2 [y" 2 + P(x)y 2 + Q(x)t/ 2 ] 

= Ci- 0 + c 2 -0 = 0, (5) 

where the multipliers of q and c 2 are zero because, by assumption, y 1 and y 2 
are solutions of (2). 

For reasons connected with the elementary algebra of vectors, the solution (4) 
is commonly called a linear combination of the solutions yfx) and y 2 (x). If we 
use this terminology. Theorem C can be restated as follows: any linear combi¬ 
nation of two solutions of the homogeneous equation (2) is also a solution. 
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Suppose that by some means or other we have managed to find two solu¬ 
tions of equation (2). Then this theorem provides us with another which 
involves two arbitrary constants, and which therefore may be the general 
solution of (2). There is one difficulty: if either y 1 or y 2 is a constant multiple 
of the other, say y 1 =ky 2 , then 

c$i + C $2 = c x ky 2 + c$ 2 =(cjfc+ c 2 )y 2 =cy 2 , 

and only one essential constant is present. On this basis we have reasonable 
grounds for hoping that if neither y 1 nor y 2 is a constant multiple of the other, 
then 


c$i{x) + c$ 2 (x) 

will be the general solution of (2). We shall prove this in the next section. 

Occasionally the special form of a linear equation enables us to find simple 
particular solutions by inspection or by experimenting with power, expo¬ 
nential, or trigonometric functions. 


Example 2. Solve 


y"+y'= o. 

By inspection we see that i/i = l and y 2 =e~ x are solutions. It is obvious 
that neither function is a constant multiple of the other, so (assuming the 
theorem stated above, but not yet proved) we conclude that 

y = Cj + c 2 e~ x 


is the general solution. 


Example 3. Solve 


x 2 y" + 2xy' - 2y = 0. 

Since differentiating a power pushes down the exponent by one unit, 
the form of this equation suggests that we look for possible solutions 
of the type y = x". On substituting this in the differential equation and 
dividing by the common factor x", we obtain the quadratic equation 
n{n - l) + 2« - 2 = 0 or n 2 + n - 2 = 0. This has roots n = 1, -2, so y t =x and 
y, = xr 2 are solutions and 


y=c l x+c 2 xr 2 

is the general solution on any interval not containing the origin. 

It is worth remarking at this point that a large part of the theory 
of linear equations rests on the fundamental properties stated in 


112 


Differential Equations with Applications and Historical Notes 


Theorems B and C. An inspection of the calculations (3) and (5) will 
show at once that these properties in turn depend on the linearity of dif¬ 
ferentiation, that is, on the fact that 

[«/ (x) + PgMl' = a f(x) + Pg'(x) 

for all constants a and p and all differentiable functions/(x) and g(pc). 


Problems 

In the following problems, assume the fact stated above (but not yet proved), 
that if yfx) and y 2 (x) are two solutions of (2) and neither is a constant multiple 
of the other, then cyjfx) + c z y 2 (x) is the general solution. 

1. (a) Verify that y 2 = l and y 2 =x * 1 2 are solutions of the reduced equation 

xy" -y' = 0, and write down the general solution. 

(b) Determine the value of a for which y p -ax 3 4 5 is a particular solution 
of the complete equation xy"-y'-3x 2 . Use this solution and the 
result of part (a) to write down the general solution of this equation. 
(Compare with Example 1 in Section 11.) 

(c) Can you discover y v y 2 , and y p by inspection? 

2. Verify that yj = l and y 2 =log x are solutions of the equation xy" + if = 0, 
and write down the general solution. Can you discover y 2 and y 2 by 
inspection? 

3. (a) Show that y, = er x and y 2 - e 2x are solutions of the reduced equation 

y" —y' - 2y = 0. What is the general solution? 

(b) Find a and b so that y p - ax + b is a particular solution of the complete 
equation y" —y' — 2y = Ax. Use this solution and the result of part (a) 
to write down the general solution of this equation. 

4. Use inspection or experiment to find a particular solution for each of 
the following equations: 

(a) x 3 y" + x 2 y' + xy =1; 

(b) y" - 2y' = 6; 

(c) y" _ 2y = sin x. 

5. In each of the following cases, use inspection or experiment to find 
particular solutions of the reduced and complete equations and write 
down the general solution: 

(a) y"=e x ; 

(b) y"-2y' = 4; 
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(c) y"~y = sinx; 

(d) (x-l)y"-xy' + y = 0; 

(e) y"+2y'=6e x . 

6. By eliminating the constants c x and c 2 , find the differential equation of 
each of the following families of curves: 

(a) y=c 1 x+c 2 x 2 ; 

(b) y = c r e kx + c 2 e~ fa ; 

(c) y=c 1 sin kx + c 2 cos fcx; 

(d) y=c l + c 2 e~ 2x } 

(e) y-c r x + c 2 sinx; 

(f) y=c 1 e x +c 2 xe Xm , 

(g) y=c l e x +c 2 e~ 3Xm , 

(h) y^CjX + CjX -1 . 

7. Verify that y = CjX -1 + c 2 x 5 is a solution of 

x 2 y" - 3xy - 5y = 0 

on any interval [a,b] that does not contain the origin. If x 0 * 0, and if y 0 
and y' 0 are arbitrary, show directly that c 1 and c 2 can be chosen in one 
and only one way so that y(x 0 ) = y 0 and y\x 0 ) = y' 0 . 

8. Show that y = x 2 sin x and y = 0 are both solutions of 

xhj" -4xy' + (x 2 + 6)y = 0, 

and that both satisfy the conditions y(0) = 0 and y'(0) = 0. Does this con¬ 
tradict Theorem A? If not, why not? 

9. If a solution of equation (2) on an interval [a,b\ is tangent to the x-axis at 
any point of this interval, then it must be identically zero. Why? 

10. If y 2 (x) and y 2 (x) are two solutions of equation (2) on an interval [a,b], 
and have a common zero in this interval, show that one is a constant 
multiple of the other. [Recall that a point x 0 is said to be a zero of a func¬ 
tion/(x) if/(x) = 0.] 


15 The General Solution of the Homogeneous Equation 

If two functions /(x) and g(x) are defined on an interval [a,b] and have the 
property that one is a constant multiple of the other, then they are said to 
be linearly dependent on [ a,b ]. Otherwise—that is, if neither is a constant mul¬ 
tiple of the other—they are called linearly independent. It is worth noting that 
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if f(x) is identically zero, then f(x) and g(x) are linearly dependent for every 
function g(x), since/(x) = 0 • g(x). 

Our purpose in this section is to prove the following theorem. 


Theorem A. Let yfx) and y 2 (x) he linearly independent solutions of the homoge¬ 
neous equation 


y" + P(x)y' + Q(x)y = 0 (1) 

on the interval [a,b\. Then 

cpjfx) + c 2 y 2 (x) (2) 

is the general solution of equation (1) on [a,b\, in the sense that every solution of( 1) 
on this interval can be obtained from (2) by a suitable choice of the arbitrary constants 
Cj and c 2 . 

The proof will be given in stages, by means of several lemmas and auxiliary 
ideas. 

Let y(x) be any solution of (1) on [a,b\. We must show that constants c, and 
c 2 can be found so that 


y( x ) = c i!/i( x ) + c 2 y 2 {x) 

for all x in [a,b\. By Theorem 14-A, a solution of (1) over all of [a,b] is com¬ 
pletely determined by its value and the value of its derivative at a single 
point. Consequently, since c^yfx) + cgjfx) and y(x) are both solutions of (1) on 
[a,b], it suffices to show that for some point x 0 in [a,b\ we can find c, and c 2 so 
that 


Cl/iC^o)+ c 2 y 2 {xf)—y(x 0 ) 


and 


c 1 y’ 1 (x 0 ) + c 2 y 2 (x 0 ) = y’(x 0 ). 


For this system to be solvable for c 1 and c 2 , it suffices that the determinant 

yi(*o) Mxo) 

,, , ,, =V\{xf)y 2 {xf)-y 2 (xf)yfxf) 

y i(*o) yi(x 0 ) 


have a value different from zero. This leads us to investigate the function of 
x defined by 


W(yi,y 2 ) = yiy2-y2yi, 
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which is known as the Wronskian 2 of t/, and y 2 , with special reference to 
whether it vanishes at x 0 . Our first lemma simplifies this problem by show¬ 
ing that the location of the point x 0 is of no consequence. 

Lemma 1. Ifyfx) and y 2 (x) are any two solutions of equation (1) on [a,b\, then their 
Wronskian W = W(y v yf) is either identically zero or never zero on [a,b\. 

Proof. We begin by observing that 

W' = 1 / 11/2 + y'iy '2 - y 2 y'[ - y'iy'i 
= V\Vz-V2Vu 

Next, since y 1 and y 2 are both solutions of (1), we have 


y”i + Py\+Qyi - 0 


and 


yi + Py'i+Qyi - 0 . 


On multiplying the first of these equations by y 2 and the second by y v and 
subtracting the first from the second, we obtain 


( 3 / 13/2 - yzy") + Piyiy'i - yzy'f) = 0 


or 


dW 

dx 


+ PW = 0. 


The general solution of this first order equation is 


W = ce'l Pdx -, (3) 

and since the exponential factor is never zero we see that W is identically 
zero if the constant c - 0 , and never zero if c * 0 , and the proof is complete . 3 

This result reduces our overall task of proving the theorem to that of 
showing that the Wronskian of any two linearly independent solutions of (1) 


2 Hoene Wronski (1778-1853) was an impecunious Pole of erratic personality who spent most 
of his life in France. The Wronskian determinant mentioned above was his sole contribu¬ 
tion to mathematics. He was the only Polish mathematician of the nineteenth century whose 
name is remembered today, which is a little surprising in view of the many eminent men in 
this field whom Poland has given to the twentieth century. 

3 Formula (3) is due to the great Norwegian mathematician Niels Henrik Abel (see Appendix B 
in Chapter 9), and is called Abel's formula. 
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is not identically zero. We accomplish this in our next lemma, which actually 
yields a bit more than is needed. 


Lemma 2. lfyfx) and y 2 (x) are tzvo solutions of equation (1) on [a,b], then they are lin¬ 
early dependent on this interval if and only if their Wronskian W(y 1 ,y 2 ) = yiy 2 - y 2 y[ 
is identically zero. 

Proof. We begin by assuming that y 1 and t/ 2 are linearly dependent, and we 
show as a consequence of this that yp)' 2 - J/ 2 J /1 = 0. First, if either function 
is identically zero on [a,b], then the conclusion is clear. We may therefore 
assume, without loss of generality, that neither is identically zero; and it fol¬ 
lows from this and their linear dependence that each is a constant multiple of 
the other. Accordingly we have y 2 = cy x for some constant c, so y' 2 = cy\. These 
equations enable us to write 

yxy' 2 -yiy\ = yfcy\)-(cyf)y\ 

-0, 

which proves this half of the lemma. 

We now assume that the Wronskian is identically zero and prove linear 
dependence. If y x is identically zero on [a,b\, then (as we remarked at the 
beginning of the section) the functions are linearly dependent. We may there¬ 
fore assume that i/, does not vanish identically on [a,b\, from which it follows 
by continuity that y 1 does not vanish at all on some subinterval [c,d\ of [a,b]. 
Since the Wronskian is identically zero on [a,b], we can divide it by y\ to get 

yiy'i-yiy'i _ n 
2 u 

yi 

on [c,d]. This can be written in the form (y 2 /yf = 0, and by integrating we 
obtain y 2 /y } -k or y 2 (x)-kyfx) for some constant k and ah x in [c,d\. Finally, 
since y 2 (x) and kyfx) have equal values in [c,d\, they have equal derivatives 
there as well; and Theorem 14-A allows us to infer that 

y 2 (x)=ky 1 (x) 

for all x in [a,b], which concludes the argument. 

With this lemma, the proof of Theorem A is complete. 


Ordinarily, the simplest way of showing that two solutions of (1) are lin¬ 
early independent over an interval is to show that their ratio is not constant 
there, and in most cases this is easily determined by inspection. On occasion, 
however, it is convenient to employ the formal test embodied in Lemma 2: 
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compute the Wronskian, and show that it does not vanish. Both procedures 
are illustrated in the following example. 


Example 1. Show that y=c 1 sin x + c 2 cos x is the general solution of 
y" + y = 0 on any interval, and find the particular solution for which 
y(0) = 2 and y'(0) = 3. 

The fact that y 2 = sin x and y, = cos x are solutions is easily verified by 
substitution. Their linear independence on any interval [a,b] follows 
from the observation that yj/y 2 = tan x is not constant, and also from the 
fact that their Wronskian never vanishes: 


W(yi,r/ 2 ) 


sinx 

cosx 


cosx 

-sinx 


= -sin * 1 2 x-cos 2 x = -l. 


Since P(x ) = 0 and Q(x) = 1 are continuous on [a,b], it now follows from 
Theorem A that y=Cj sin x + c 2 cos x is the general solution of the given 
equation on [a,b]. Furthermore, since the interval [a,b] can be expanded 
indefinitely without introducing points at which P(x) or Q(x) is discon¬ 
tinuous, this general solution is valid for all x. To find the required par¬ 
ticular solution, we solve the system 

c 1 sin 0 + c 2 cos 0 = 2 , 

q cos 0 - c 2 sin 0 = 3. 

This yields c 2 = 2 and c x = 3, so y = 3 sin x + 2 cos x is the particular solution 
that satisfies the given conditions. 


The concepts of linear dependence and independence are significant in a 
much wider context than appears here. As the reader is perhaps already 
aware, the important branch of mathematics known as linear algebra is in 
essence little more than an abstract treatment of these concepts, with many 
applications to algebra, geometry, and analysis. 


Problems 

In Problems 1 to 7, use Wronskians to establish linear independence. 

1. Show that e x and e~ x are linearly independent solutions of y" -y = 0 on 
any interval. 

2 . Show that y = c 1 x + c 2 x 2 is the general solution of 

x 2 y" -2xy'+2y = Q 

on any interval not containing 0 , and find the particular solution for 
which y(l) = 3 and y'( 1) = 5. 
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3. Show that y = c x e x + c 2 e 2x is the general solution of 

y"-3y' + 2y = 0 

on any interval, and find the particular solution for which y( 0 ) = -1 and 

y\ o) = i. 

4. Show that y = c r e 2x + c 2 xe 2x is the general solution of 

y" - 4y'+4y = 0 


on any interval. 

5. By inspection or experiment, find two linearly independent solutions 
of x 2 y" - 2 y = 0 on the interval [ 1 , 2 ], and determine the particular solu¬ 
tion satisfying the initial conditions y(l) = 1 , y'(l) = 8. 

6 . In each of the following, verify that the functions yfx) and y 2 (x) are 
linearly independent solutions of the given differential equation on 
the interval [ 0 , 2 ], and find the solution satisfying the stated initial 
conditions: 

(a) y" + y'- 2 y = 0 , y 1 = e x and y 2 = e~ 2x , y( 0 ) = 8 and y'( 0 ) = 2 ; 

(b) y" + y'- 2 y = 0 , y 1 =e x andy 2 = e~ 2x , y(l) = 0 andy'(l) = 0 ; 

(c) y" + 5 y' + 6 y = 0, y x = e~ 2x and y 2 = e~ 3x , y(0) = 1 and y'(0) = V 

(d) y" + y' = 0 , yi = 1 and y 2 = e~ x , y(2) = 0 and y'( 2 ) = e~ 2 . 

7. (a) Use one (or both) of the methods described in Section 11 to find all 

solutions of y" + (y ') 2 = 0 . 

(b) Verify that y x = 1 and y 2 = log x are linearly independent solutions of 
the equation in part (a) on any interval to the right of the origin. Is 
y-c r + c 2 log x the general solution? If not, why not? 

8 . Use the Wronskian to prove that two solutions of the homogeneous 
equation ( 1 ) on an interval [a,b\ are linearly dependent if 

(a) they have a common zero x 0 in the interval (Problem 14-10); 

(b) they have maxima or minima at the same point x 0 in the interval. 

9. Consider the two functions /(x) = x 3 and g(x)= x 2 |x| on the interval 

[- 1 , 1 ]. 

(a) Show that their Wronskian W(f, g) vanishes identically. 

(b) Show that/and g are not linearly dependent. 

(c) Do (a) and (b) contradict Lemma 2? If not, why not? 

10. It is clear that sin x, cos x and sin x, sin x - cos x are two distinct pairs of 
linearly independent solutions of y" + y = 0. Thus, if y, and y 2 are linearly 
independent solutions of the homogeneous equation 
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y" + P(x)y' + Q(x)y= 0, 


we see that y t and y 2 are not uniquely determined by the equation, 

(a) Show that 


W(yi,y 2 ) 


and 



so that the equation is uniquely determined by any given pair of 
linearly independent solutions. 

(b) Use (a) to reconstruct the equation y" + y = 0 from each of the two 
pairs of linearly independent solutions mentioned above. 

(c) Use (a) to reconstruct the equation in Problem 4 from the pair of 
linearly independent solutions e 2x , xe 2x . 

11. (a) Show that by applying the substitution y = uv to the homogeneous 
equation (1) it is possible to obtain a homogeneous second order lin¬ 
ear equation for v with no v’ term present. Find u and the equation 
for v in terms of the original coefficients P(x) and Q(x). 

(b) Use the method of part (a) to find the general solution of 
y" + 2xy' + (1 + x 2 )y = 0. 


16 The Use of a Known Solution to find Another 

As we have seen, it is easy to write down the general solution of the homo¬ 
geneous equation 


y" + P(x)y' + Q(x)y = 0 


( 1 ) 


whenever we know two linearly independent solutions y x (x) and y 2 (x). But 
how do we find y, and y 2 ? Unfortunately there is no general method for 
doing this. However, there does exist a standard procedure for determin¬ 
ing y 2 when y, is known. This is of considerable importance, for in many 
cases a single solution of (1) can be found by inspection or some other 
device. 

To develop this procedure, we assume that y x (x) is a known nonzero solu¬ 
tion of (1), so that cy t (x) is also a solution for any constant c. The basic idea is 
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to replace the constant c by an unknown function v{x), and then to attempt to 
determine v in such a manner that y 2 = vy ] will be a solution of (1). It isn't at 
all clear in advance that this approach will work, but it does. To see how we 
might think of trying it, recall that the linear independence of the two solu¬ 
tions i p and y 2 requires that the ratio y 2 /y x must be a nonconstant function 
of x, say v, and if we can find v, then since we know ty, we have y 2 and our 
problem is solved. 

We assume, then, that y 2 =vy l is a solution of (1), so that 


y 2 + Py 2 + Qy 2 - 0, (2) 

and we try to discover the unknown function v(x). On substituting y 2 - vy , 
and the expressions 

y’i = v y\ + v'xji and y” 2 = vy\ + 2 v’y\ + v”y x 
into (2) and rearranging, we get 

v{y{ + Py[ + Qyi) + v”y x + v\2 y\ + Pi/ X ) = 0. 

Since y x is a solution of (1), this reduces to 

zA/i + v'(2y[ + Py x ) = 0 



v' yi 


An integration now gives 


so 


log v' = -2 log ij] - j ’pdx, 


v = 


\Pdx 


y i 


and 


v = 



( 3 ) 
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All that remains is to show that y 1 and y 2 = vi) u where v is given by (3), actu¬ 
ally are linearly independent as claimed; and this we leave to the reader in 
Problem l. 4 

Example 1. y x =x is a solution of x 2 y" +xy' - y =0 which is simple enough 
to be discovered by inspection. Find the general solution. 

We begin by writing the given equation in the form of (1): 



Since P(x) = l/x, a second linearly independent solution is given by 
y,= vy v where 



This yields y 2 = (-l/2)x _1 , so the general solution is y=c 1 x+c 1 x~ l . 


Problems 


1. If y x is a nonzero solution of equation (1) and ty 2 = vy l where v is given by 
formula (3), is the second solution found in the text, show by computing 
the Wronskian that y x and y 2 are linearly independent. 

2. Use the method of this section to find y 2 and the general solution of 
each of the following equations from the given solution yy 

(a) y" + y = 0, y x = sin x; 

(b) y” -y=0,y 1 =e x . 

3. The equation xy" + 3ty' = 0 has the obvious solution t/, = 1. Find y 2 and the 
general solution. 

4. Verify that y x = x 2 is one solution of xhj" +xy' - 4y = 0, and find y 2 and the 
general solution. 

5. The equation (1 - x 2 )y" - 2 xy' + 2y = 0 is the special case of Legendre's 
equation 


(1 - x 2 )iy" - 2xy' + p{p + l)y=0 


corresponding to p - 1. It has y x - x as an obvious solution. Find the gen¬ 
eral solution. 


4 Formula (3) is due to the eminent French mathematician Joseph Liouville (see the note at the 
end of Section 43). 
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6. The equation x 2 y" + xy' + (x 2 
equation 



x 2 y"+ xy' + (x 2 - p 2 )y = 0 

corresponding to p= . Verify that y, = x _1/2 sin x is one solution over 


any interval including only positive values of x, and find the general 
solution. 

7. Use the fact that y x = x is an obvious solution of each of the following 
equations to find their general solutions: 


(b) x 2 y" + Ixy' - 2y = 0; 

(c) x 2 y" - x(x + 2)y' + (x + 2)y = 0. 

8. Find the general solution of y" - x/(x)y' +/(x)y = 0. 

9. Verify that one solution of xy" - (2x + l)y' + (x + l)y = 0 is given by y, = e x , 
and find the general solution. 

10. (a) If n is a positive integer, find two linearly independent solutions of 


xy" - (x + n)y' + ny = 0. 


(b) Find the general solution of the equation in part (a) for the cases 
n = l, 2, 3. 

11. Find the general solution of y" -/(x)y' + \f(x) - l]y = 0. 

12. For another, faster approach to formula (3), show that 
v' = {yz/yfi' = W(yi,y 2 )/yi and use Abel's formula in Section 15 to 
obtain v. 


17 The Homogeneous Equation with Constant Coefficients 

We are now in a position to give a complete discussion of the homogeneous 
equation y" + P(x)y' + Q(x)y = 0 for the special case in which P(x) and Q(x) are 
constants p and q: 


( 1 ) 


y"+py' + qy = 0. 


Our starting point is the fact that the exponential function e mx has the prop¬ 
erty that its derivatives are all constant multiples of the function itself. This 
leads us to consider 


( 2 ) 
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as a possible solution for (1) if the constant m is suitably chosen. Since y' = me"' 1 
and y" = m 2 e mx , substitution in (1) yields 

(m 2 + pm + q)e mx = 0; (3) 

and since e mx is never zero, (3) holds if and only if m satisfies the auxiliary 
equation 


m 2 + pm + q = 0. (4) 

The two roots m l and m 2 of this equation, that is, the values of m for which (2) 
is a solution of (1), are given by the quadratic formula: 


m lf m 2 


~P±yjp 2 - 4 ? 
2 


( 5 ) 


Further development of this situation requires separate treatment of the 
three possibilities inherent in (5). 


Distinct real roots. 5 It is clear that the roots m 1 and m 2 are distinct real num¬ 
bers if and only if p 2 - Aq > 0. In this case we get the two solutions 

e mx and e mx . 


Since the ratio 


p m i* 

c __ 

e m2X 


is not constant, these solutions are linearly independent and 

y = c x e mx + c 2 e mx (6) 


is the general solution of (1). 


Distinct complex roots. The roots m l and m 2 are distinct complex numbers if 
and only if p 2 -Aq< 0. In this case m t and m 2 can be written in the form a ± ib; 
and by Euler's formula 


e ;e_ cos 0 + f s in 0 


( 7 ) 


5 We take it for granted that the reader is acquainted with the elementary algebra of complex 
numbers. Euler's formula (7) is—or ought to be—a standard part of any reasonably satisfac¬ 
tory course in calculus. 
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our two solutions of (1) are 

e mx = e {a+ib)x = e ax e ibx = e“(cos bx + i sin bx) (8) 

and 

e mix = e {a -' b)x = e ax e- ibx = e“(cos bx - i sin bx). (9) 

Since we are interested only in solutions that are real-valued functions, we 
can add (8) and (9) and divide by 2, and subtract and divide by 2 i, to obtain 

e ax cos bx and e ax sin bx. (10) 

These solutions are linearly independent, so the general solution of (1) in this 
case is 


y=e ax (c 1 cos bx + c 2 sin bx). (11) 

We can look at this matter from another point of view. A complex-valued 
function w(x) = u(x) + iv(x) satisfies equation (1), in which p and q are real num¬ 
bers, if and only if u(x) and v(x) satisfy (1) separately. Accordingly, a complex 
solution of (1) always contains two real solutions, and (8) yields the two solu¬ 
tions (10) at once. 


Equal real roots. It is evident that the roots m 1 and m 2 are equal real num¬ 
bers if and only if p 2 - Aq = 0. Here we obtain only one solution y = e mx with 
m=-p/2. However, we can easily find a second linearly independent solution 
by the method of the preceding section: if we take y 1 -e ( ~f /2 l x , then 

v= [\e ^ dx dx = f—— e~ px dx = x 
J y[ J e p 

and y 2 = vy 1 = xe mx . In this case (1) has 


y=c 1 e mx +c 2 xe mx 


( 12 ) 


as its general solution. 

In summary, we have three possible forms—given by formulas (6), (11), 
and (12)—for the general solution of the homogeneous equation (1) with 
constant coefficients, depending on the nature of the roots m l and m 2 of 
the auxiliary equation (4). It is clear that the qualitative nature of this gen¬ 
eral solution is fully determined by the signs and relative magnitudes of 
the coefficients p and q, and can be radically changed by altering their 
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numerical values. This matter is important for physicists concerned with 
the detailed analysis of mechanical systems or electric circuits described by 
equations of the form (1). For instance, if p 2 < 4 q, the graph of the solution 
is a wave whose amplitude increases or decreases exponentially according 
as p is negative or positive. This statement and others like it are obvious 
consequences of the above discussion, and are given exhaustive treatment 
in books dealing more fully with the elementary physical applications of 
differential equations. 

The ideas of this section are primarily due to Euler. A brief sketch of a 
few of the many achievements of this great scientific genius is given in 
Appendix A. 


Problems 

1. Find the general solution of each of the following equations: 


(a) 

y" 

+ y'-6y=0; 

(b) 

y" 

+ 2y'+y=0; 

(c) 

y" 

+ 

00 

II 

O 

(d) 

2 y 

" -4y' + 8y = 0; 

(e) 

y" 

- 4 y'+4y = 0; 

(f) 

y" 

-9y' + 20y = 0; 

(g) 

2 y 

o 

II 

CO 

+ 

+ 

(h) 

4 y 

" - 12y' + 9y = 0; 

(i) 

y" 

+y'= 0 ; 

(j) 

y" 

- 6y' + 25y = 0; 

(k) 

4 y 

" + 20y' + 25y = 0; 

(1) 

y" 

+ 2y + 3 y = 0; 

(m) 

y” 

= 4 y; 

(n) 

4 y 

" -8y' + 7y = 0; 

(o) 

2 y 

+ 

<< 

1 

II 

o 

(P) 

16y" - 8y'+y = 0; 

(q) 

y" 

+ 4y' + 5y = 0; 

(r) 

y" 

+4y' - 5y = 0. 


2. Find the solutions of the following initial value problems: 

(a) y" - 5y' + 6y = 0, y(l) = e 2 and y'(l) = 3e 2 ; 

(b) y" - 6y' + 5y = 0, y(0) = 3 and y'(0) = 11; 
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(c) y" - 6 y' + 9y = 0, y{ 0) = 0 and y'( 0) = 5; 

(d) y" + Ay' + 5y = 0, y( 0) = 1 and y'(0) = 0; 

(e) y"+4y' + 2y = 0, y(0) = -l andy'(0) = 2 + 3V2; 

(f) y" + 8y' - 9y=0, y(l) = 2 and y'(l) = 0. 

3. Show that the general solution of equation (1) approaches 0 as x -*■ °° if 
and only if p and q are both positive. 

4. Without using the formulas obtained in this section, show that the 
derivative of any solution of equation (1) is also a solution. 

5. The equation 

xY + pxy’+qy= 0 , 

where p and q are constants, is called Euler's equidimensional equa¬ 
tion. 6 Show that the change of independent variable given by x-e z 
transforms it into an equation with constant coefficients, and apply 
this technique to find the general solution of each of the following 
equations: 

(a) x 2 y" + 3xy'+ lOy = 0; 

(b) 2x 2 y" + 10xy'+ 8y = 0; 

(c) x 2 y" + Ixy' - Ely = 0; 

(d) 4x 2 y" - 3y = 0; 

(e) x 2 y" - 3 xy' + 4y = 0; 

(f) x 2 y" + Ixy' - 6y = 0; 

(g) x 2 y" + 2xy' + 3y = 0; 

(h) x 2 y"+xy' - 2y = 0; 

(i) x 2 y"+xy'-16y = 0. 

6. In Problem 5 certain homogeneous equations with variable coefficients 
were transformed into equations with constant coefficients by chang¬ 
ing the independent variable from xtoz = log x. Consider the general 
homogeneous equation 

y" + P(x)y' + Q(x)y = 0, (*) 

and change the independent variable from x to z = z(x), where z(x) is an 
unspecified function of x. Show that equation (*) can be transformed 
in this way into an equation with constant coefficients if and only if 

(Q' +2PQ)/Q 3/2 is constant, in which case z = J* fQ(x)dx will effect the 
desired result. 


6 It is also known as Cauchy's equidimensional equation. Euler's researches were so extensive that 
many mathematicians try to avoid confusion by naming equations, formulas, theorems, etc., 
for the person who first studied them after Euler. 
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7. Use the result of Problem 6 to discover whether each of the following 
equations can be transformed into an equation with constant coefficients 
by changing the independent variable, and solve it if this is possible: 

(a) xy" + (x 2 - l)y' + x 3 y = 0; 

(b) y" + 3xy' + x 2 y = 0. 

8. In this problem we present another way of discovering the second lin¬ 
early independent solution of (1) when the roots of the auxiliary equa¬ 
tion are real and equal. 

(a) If n/| * m 2 , verify that the differential equation 

y" - (m 1 + m^y' + = 0 

has 

y =- 

mi - m 2 

as a solution. 

(b) Think of m 2 as fixed and use THospital's rule to find the limit of the 
solution in part (a) as m 1 -* m 2 . 

(c) Verify that the limit in part (b) satisfies the differential equation 
obtained from the equation in part (a) by replacing m 2 by m 2 . 


18 The Method of Undetermined Coefficients 

In the preceding two sections we considered several ways of finding the gen¬ 
eral solution of the homogeneous equation 

y" + P(x)y' + Q(x)y = 0. (1) 

As we saw, these methods are effective in only a few special cases: when 
the coefficients P(x) and Q(x) are constants, and when they are not constants 
but are still simple enough to enable us to discover one nonzero solution 
by inspection. Fortunately these categories are sufficiently broad to cover a 
number of significant applications. However, it should be clearly understood 
that many homogeneous equations of great importance in mathematics and 
physics are beyond the reach of these procedures, and can only be solved by 
the method of power series developed in Chapter 5. 

In this and the next section we turn to the problem of solving the nonho- 
mogeneous equation 


y" + P(x)y' + Q(x)y = P(X) 


(2) 
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for those cases in which the general solution y g (x) of the corresponding 
homogeneous equation (1) is already known. By Theorem 14-B, if y p (x) is any 
particular solution of (2), then 


y(x)=y g (x) + y p (x) 

is the general solution of (2). But how do we find yf! This is the practical prob¬ 
lem that we now consider. 

The method of undetermined coefficients is a procedure for finding y p 
when (2) has the form 


y"+py' + qy = R(x), (3) 

where p and q are constants and R(x) is an exponential, a sine or cosine, a 
polynomial, or some combination of such functions. As an example, we 
study the equation 


y"+py' +qy=e ax . (4) 

Since differentiating an exponential such as e ax merely reproduces the func¬ 
tion with a possible change in the numerical coefficient, it is natural to guess 
that 


y v =Ae“ x (5) 

might be a particular solution of (4). Here A is the undetermined coefficient 
that we want to determine in such a way that (5) will actually satisfy (4). On 
substituting (5) into (4), we get 

A(a 2 + pa + q)e ax = e ax , 


so 


A = 


1 

a 2 + pa + q 


( 6 ) 


This value of A will make (5) a solution of (4) except when the denominator on 
the right of (6) is zero. The source of this difficulty is easy to understand, for the 
exception arises when a is a root of the auxiliary equation 

m 2 +pm+q = 0, (7) 

and in this case we know that (5) reduces the left side of (4) to zero and can¬ 
not possibly satisfy (4) as it stands, with the right side different from zero. 
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What can be done to continue the procedure in this exceptional case? We 
saw in the previous section that when the auxiliary equation has a double 
root, the second linearly independent solution of the homogeneous equation 
is obtained by multiplying by x. With this as a hint, we take 


y p =Axe ax 


( 8 ) 


as a substitute trial solution. On inserting (8) into (4), we get 
A(a 2 + pa + q)xe ax +A(2a + p)e ax = e ax . 

The first expression in parentheses is zero because of our assumption that a 
is a root of (7), so 


A = 


1 

2 a + p 


( 9 ) 


This gives a valid coefficient for (8) except when a = -p/2, that is, except when 
a is a double root of (7). In this case we hopefully continue the successful pat¬ 
tern indicated above and try 


y p =Ax 2 e ax . 


( 10 ) 


Substitution of (10) into (4) yields 

A(a 2 + pa + q)x 2 e ax + 1A(2a + p)xe ax + 2 Ae ax = e ax . 

Since a is now assumed to be a double root of (7), both expressions in paren¬ 
theses are zero and 


* = \- ( 11 ) 

To summarize: If a is not a root of the auxiliary equation (7), then (4) has a 
particular solution of the form Ae ax ; if a is a simple root of (7), then (4) has 
no solution of the form Ae ax but does have one of the form Axe'"; and if a is a 
double root, then (4) has no solution of the form Axe ax but does have one of 
the form Ax 2 e ax . In each case we have given a formula for A, but only for the 
purpose of clarifying the reasons behind the events. In practice it is easier to 
find A by direct substitution in the equation at hand. 

Another important case where the method of undetermined coefficients 
can be applied is that in which the right side of equation (4) is replaced by 
sin bx : 


y" +py' + qy = sin bx. 


(12) 
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Since the derivatives of sin bx are constant multiples of sin bx and cos bx, we 
take a trial solution of the form 

y v =A sin bx + B cos bx. (13) 

The undetermined coefficients A and B can now be computed by substi¬ 
tuting (13) into (12) and equating the resulting coefficients of sin bx and 
cos bx on the left and right. These steps work just as well if the right side 
of equation (12) is replaced by cos bx or any linear combination of sin bx 
and cos bx, that is, any function of the form a sin bx + p cos bx. As before, 
the method breaks down if (13) satisfies the homogeneous equation cor¬ 
responding to (12). When this happens, the procedure can be carried 
through by using 


y v = x(A sin bx + B cos bx) (14) 

as our trial solution instead of (13). 


Example 1. Find a particular solution of 

y"+y = sinx. (15) 

The reduced homogeneous equation y" + y = 0 has y=c 1 sin x+c 2 cos x as 
its general solution, so it is useless to take y p =A sin x + B cos x as a trial 
solution for the complete equation (15). We therefore try y p = x(A sin x + 
B cos x). This yields 

y p = Asinx + Bcosi' + x(Acosx-Bsinx) 


and 


y" = 2Acosx-2Bsinx + x(-Asinx-Bcosx), 


and by substituting in (15) we obtain 


2A cos x - 2B sin x = sin x. 


This tells us that the choice A = 0 and B-satisfies our requirement, so 

1 ^ 
y p = —x cos x is the desired particular solution. 


Finally, we consider the case in which the right side of equation (4) is replaced 
by a polynomial: 


y" + py' + qy = a 0 + aiX + • • • + a n x n . 


( 16 ) 
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Since the derivative of a polynomial is again a polynomial, we are led to seek 
a particular solution of the form 


y p — Aq + A\X + • • • + A n x n . (17) 

When (17) is substituted into (16), we have only to equate the coefficients of 
like powers of x to find the values of the undetermined coefficients A 0 , A v ..., 
A n . If the constant q happens to be zero, then this procedure gives x" _1 as the 
highest power of x on the left of (16), so in this case we take our trial solution 
in the form 


y P =x(A 0 + A x x + • • • + A n x n ) 

- A 0 x + Aqx 2 + ■ ■ • +A n x n+1 (18) 

If p and q are both zero, then (16) can be solved at once by direct integration. 


Example 2. Find the general solution of 

y" ~y' ~ 2y=4x 2 . (19) 

The reduced homogeneous equation y" - y' -2y =0 has m 2 - in - 2 = 0 or 
(m - 2 )(m + 1) = 0 as its auxiliary equation, so the general solution of the 
reduced equation is y g = c 2 e lx + c 2 e~ x . 

Since the right side of the complete equation (19) is a polynomial of the 
second degree, we take a trial solution of the form y p =A + Bx+Cx 2 and 
substitute it into (19): 

2C - (B + 2Cx) - 2 (A + Bx + Cx 2 ) = 4x 2 . 

Equating coefficients of like powers of x gives the system of linear 
equations 


2C-B-2h = 0, 

- 2C - 2B = 0, 

-2C=4. 

We now easily see that C = -2, B = 2, and A = -3, so our particular solution 
is y p = - 3 + 2x - 2x 2 and 


y = c x e 2x + c 2 e~ x - 3 + 2x - 2x 2 
is the general solution of the complete equation (19). 


The above discussions show that the form of a particular solution of equa¬ 
tion (3) can often be inferred from the form of the right-hand member R(x). 
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In general this is true whenever R(x) is a function with only a finite number 
of essentially different derivatives. We have seen how this works for expo¬ 
nentials, sines and cosines, and polynomials. In Problem 3 we indicate a 
course of action for the case in which R(x) is a sum of such functions. It is also 
possible to develop slightly more elaborate techniques for handling various 
products of these elementary functions, but for most practical purposes this 
is unnecessary. In essence, the whole matter is simply a question of intelli¬ 
gent guesswork involving a sufficient number of undetermined coefficients 
that can be tailored to fit the circumstances. 


Problems 

1. Find the general solution of each of the following equations: 


(a) 

y" 

+ 3y' - lOy = 6e 41 '; 

(b) 

y" 

+ Ay - 3 sin x; 

(c) 

y" 

+ lOy ' + 25y = I4e~ 5x ; 

(d) 

y" 

- 1y ' + 5y = 25x 2 3 + 12; 

(e) 

y" 

- y' - 6y = 20e~ 2x ; 

(f) 

y" 

- 3y' + 2y = 14 sin 2x 

(g) 

y" 

+ y = 2 cos x; 

(h) 

y" 

- 2y' = 12x - 10; 

(i) 

y" 

- 2y' +y=6e x ; 

(j) 

y" 

- 2y' + 2y = e x sin x; 

(k) 

y" 

+ y' = 10x 4 + 2. 


2. If k and b are positive constants, find the general solution of 

y" + k 2 y = sin bx. 

3. If yfx) and y 2 (x) are solutions of 

y" + P{x)y' + Q(x)y=R 1 (x) 

and 

y" + P{x)y' + Q(x)y = R 2 (x), 
show that y(x) = yfx) + y 2 (x) is a solution of 


y" + P(x)iy' + Q(x)y = Rfx) + R 2 (x). 
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This is called the principle of superposition. Use this principle to find the 
general solution of 

(a) y" + Ay - 4 cos 2x + 6 cos x + 8x 2 - 4x; 

(b) y" + 9y-2 sin 3x + 4 sin x - 26e~ 2x + 2 7x 3 . 


19 The Method of Variation of Parameters 

The technique described in Section 18 for determining a particular solution 
of the nonhomogeneous equation 

y" + P(x)y' + Q(x)y = R(x) (1) 

has two severe limitations: it can be used only when the coefficients P(x) and 
Q(x) are constants, and even then it works only when the right-hand term 
R(x) has a particularly simple form. Within these limitations, however, this 
procedure is usually the easiest to apply. 

We now develop a more powerful method that always works—regardless 
of the nature of P, Q, and R —provided only that the general solution of the 
corresponding homogeneous equation 

y" + P(x)y' + Q(x)y=0 (2) 

is already known. We assume, then, that in some way the general solution 

yW = CiJ/ 1 (x) + c 2 y 2 (x) (3) 

of (2) has been found. The method is similar to that discussed in Section 16; 
that is, we replace the constants c 1 and c 2 by unknown functions vfx) and 
v 2 (x), and attempt to determine v t and v 2 in such a manner that 


y = v$ l + v 2 y 2 (4) 

will be a solution of (l). 7 With two unknown functions to find, it will be nec¬ 
essary to have two equations relating these functions. We obtain one of these 
by requiring that (4) be a solution of (1). It will soon be clear what the second 
equation should be. We begin by computing the derivative of (4), arranged 
as follows: 


y' = (v x y\ + v 2 y' 2 ) + {v'pjr + v' 2 y 2 ). 


( 5 ) 


7 This is the source of the name variation of parameters: we vary the parameters Cj and c 2 . 
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Another differentiation will introduce second derivatives of the unknowns 
v x and v 2 . We avoid this complication by requiring the second expression in 
parentheses to vanish: 


v[y 1 +v 2 y 2 = 0. 


( 6 ) 


This gives 


y' = v 1 y[+v 2 y 2l 


( 7 ) 


so 


y" = v x y\ + v\y\ + v 2 y\ + v' 2 y' 2 . (8) 

On substituting (4), (7), and (8) into (1), and rearranging, we get 

vfy” + Py'i + Qyi) + v 2 (y 2 + Py 2 + Qy 2 ) + v\y\ + v' 2 y' 2 = R(x). (9) 

Since y ] and y 2 are solutions of (2), the two expressions in parentheses are 
equal to 0, and (9) collapses to 


v[y' 1 + v' 2 y 2 =R(x). (10) 

Taking (6) and (10) together, we have two equations in the two unknowns v\ 
and v 2 : 


v[y x + v 2 y 2 = 0, 
v[y[ + v' 2 y 2 = R(x). 


These can be solved at once, giving 


v , _ -y 2 R(x) 

1 W(y u y 2 ) 


and 


v , _ yiR(x) 

2 myi,y 2 )‘ 


(ii) 


It should be noted that these formulas are legitimate, for the Wronskian in 
the denominators is nonzero by the linear independence of y x and y 2 . All that 
remains is to integrate formulas (11) to find v x and v 2 - 


. r -j/JM dx and r 
J W<u„u 2 ) - J 


y i R ( x ) 

W(yi,y 2 )' 


dx. 


( 12 ) 


We can now put everything together and assert that 




yiR(x) 

W(yi,y 2 )' 


dx 


(13) 


is the particular solution of (1) we are seeking. 
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The reader will see that this method has disadvantages of its own. In par¬ 
ticular, the integrals in (12) may be difficult or impossible to work out. Also, 
of course, it is necessary to know the general solution of (2) before the process 
can even be started; but this objection is really immaterial because we are 
unlikely to care about finding a particular solution of (1) unless the general 
solution of (2) is already at hand. 

The method of variation of parameters was invented by the French math¬ 
ematician Lagrange in connection with his epoch-making work in analytical 
mechanics (see Appendix A in Chapter 12). 

Example 1. Find a particular solution of y" + y = esc x. 

The corresponding homogeneous equation y" + y = 0 has i/(x) = c 1 sin x + 
c 2 cos x as its general solution, so y 1 = sin x, y\ = cos x, y 2 = cos x, and 
1/2 = -sinx. The Wronskian of iq and i/ 2 is 


W(j/i,y 2 ) = yiy'i - yiy'i = - sin * 1 2 x - cos 2 x = -1, 


so by (12) we have 


f-cosxcscx, fcosx , , ,. , 

Vi = - ax = - ax = iog(smx) 

J -1 J sinx 


-cosxcscx 


and 



Accordingly, 


y = sin x log (sin x) - x cos x 


is the desired particular solution. 


Problems 

1. Find a particular solution of 


y" -2i/ + ij = 2x, 


first by inspection and then by variation of parameters. 

2. Find a particular solution of 

y" -y' -6y=e-*, 


first by undetermined coefficients and then by variation of parameters. 
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3. Find a particular solution of each of the following equations: 

(a) y” + Ay = tan 2x; 

(b) y" + 2y' + y = e~ x log x; 

(c) y" -2y' -3y = 64:xe- x ; 

(d) y" + 2y' + 5y = e _ *sec 2x; 

(e) 2y" + 3y'+y = e _3 h 

(f) y"-3y' + 2y = (l+e-")- 1 . 

4. Find a particular solution of each of the following equations: 

(a) y" + y = sec x; 

(b) y" + y = cot 2 x; 

(c) y" + y = cot 2x; 

(d) y" + y = xcosx; 

(e) y" + y = tanx; 

(f) y" + y- sec x tan x; 

(g) y" + y — sec x esc x. 

5. (a) Show that the method of variation of parameters applied to the 

equation y" + y =/(x) leads to the particular solution 

X 

y p (x) = j/(f)sin(x-f)df. 

o 

(b) Find a similar formula for a particular solution of the equation 
y" + k 2 y =f(x), where A: is a positive constant. 

6. Find the general solution of each of the following equations: 

(a) (x 2 - l)y" - 2xy'' + 2y = (x 2 - l) 2 ; 

(b) (x 2 + x)y" + (2 - x 2 )y' - (2 + x)y = x(x +1) 2 ; 

(c) (1 -x)y" + xy' -y=(l - x) 2 ; 

(d) xy" - (1 +x)y' + y-x 2 e 2x ', 

(e) x 2 y " - 2xy' + 2y = xe _I 


20 Vibrations in Mechanical and Electrical Systems 

Generally speaking, vibrations occur whenever a physical system in stable 
equilibrium is disturbed, for then it is subject to forces tending to restore its 
equilibrium. In the present section we shall see how situations of this kind 
can lead to differential equations of the form 
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and also how the study of these equations sheds light on the physical 


circumstances. 

Undamped simple harmonic vibrations. As a continuing example, we 
consider a cart of mass M attached to a nearby wall by means of a spring 
(Figure 25). The spring exerts no force when the cart is at its equilibrium 
position x = 0. If the cart is displaced by a distance x, then the spring exerts 
a restoring force F s =-kx, where A: is a positive constant whose magnitude is 
a measure of the stiffness of the spring. By Newton's second law of motion, 
which says that the mass of the cart times its acceleration equals the total 
force acting on it, we have 



( 1 ) 


or 



dt 2 M 


( 2 ) 


It will be convenient to write this equation of motion in the form 



( 3 ) 


where a = , Jk/M , and its general solution can be written down at once: 


x = c 1 sin at + c 2 cos at. 


( 4 ) 



X 


FIGURE 25 
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If the cart is pulled aside to the position x - x 0 and released without any ini¬ 
tial velocity at time t = 0, so that our initial conditions are 

dx 

x = x 0 and v = — = 0 when t = 0, (5) 

dt 

then it is easily seen that c 1 - 0 and c 2 =x 0 , so (4) becomes 

x-x 0 cos at. (6) 

The graph of (6) is shown in Figure 26. The amplitude of this simple harmonic 
vibration is x 0 ; and since its period T is the time required for one complete 
cycle, we have aT = 2% and 



Its frequency f is the number of cycles per unit time, so fT = 1 and 

, i_ a _ i nr 

* ~ T ~ 2n ~2n\lM' 


( 7 ) 


( 8 ) 


It is clear from (8) that the frequency of this vibration increases if the stiffness 
of the spring is increased or if the mass of the cart is decreased, as our com¬ 
mon sense would have led us to predict. 


Damped vibrations. As our next step in developing this physical problem, 
we consider the additional effect of a damping force F d due to the viscos¬ 
ity of the medium through which the cart moves (air, water, oil, etc.). We 



FIGURE 26 
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make the specific assumption that this force opposes the motion and has 
magnitude proportional to the velocity, that is, that F cl = -c(dx/dt), where c 
is a positive constant measuring the resistance of the medium. Equation (1) 
now becomes 



( 9 ) 


so 



c dx k n 

-+ —x = 0. 

M dt M 


( 10 ) 


Again for the sake of convenience, we write this in the form 



( 11 ) 


where b = c/IM and a = ^Jk/M. The auxiliary equation is 

m 2 + 2 bm + a 2 - 0, 


( 12 ) 


and its roots m 1 and m 2 are given by 


■2b ± V# 2 - 4a 2 
2 


(13) 


m 1 ,m 2 


I he general solution of (11) is of course determined by the nature of the 
numbers m 1 and m 2 . As we know, there are three cases, which we consider 
separately. 

CASE A. b 2 - a 2 > 0 or b > a. In loose terms this amounts to assuming that 
the frictional force due to the viscosity is large compared to the stiffness of 
the spring. It follows that m 1 and m 2 are distinct negative numbers, and the 
general solution of (11) is 


x = Cl e mt + c 2 e mt . 


(14) 


If we apply the initial conditions (5) to evaluate c 1 and c 2 , (14) becomes 


(: m 1 e mit -rn 2 e mt ). 


(15) 


nil — m 2 
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X 



FIGURE 27 


The graph of this function is given in Figure 27. It is clear that no vibration 
occurs, and that the cart merely subsides to its equilibrium position. This 
type of motion is called overdamped. We now imagine that the viscosity is 
decreased until we reach the condition of the next case. 

CASE B. b 2 - a 2 -0 or b = a. Here we have m 1 = m 2 = -b = -a, and the general 
solution of (11) is 


x = c r e~ al + c 2 te~ at . 


(16) 


When the initial conditions (5) are imposed, we obtain 


x = x 0 e~ at (l + at). 


(17) 


This function has a graph similar to that of (15), and again we have no vibra¬ 
tion. Any motion of this kind is said to be critically damped. If the viscosity 
is now decreased by any amount, however small, then the motion becomes 
vibratory, and is called underdamped. This is the really interesting situation, 
which we discuss as follows. 

CASE C. b 2 - a 2 < 0 or b < a. Here m 1 and m 2 are conjugate complex numbers 
-b ± ia, where 


a = 



and the general solution of (11) is 


x = e~ ii {c l cos at + c 2 sin at). 


(18) 
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When c 1 and c 2 are evaluated in accordance with the initial conditions (5), 
this becomes 


x = — e fcf (acosai + bsinai). (19) 

a 

If we introduce 0 = tan* 1 ( b/a ), then (19) can be expressed in the more reveal¬ 
ing form 


x = 


x 0 Va 2 +b 2 


e ’'cos(af-0). 


( 20 ) 


This function oscillates with an amplitude that falls off exponentially, as 
Figure 28 shows. It is not periodic in the strict sense, but its graph crosses the 
equilibrium position x = 0 at regular intervals. If we consider its "period" T as 
the time required for one complete "cycle," then aT = 2k and 


2 k _ 27 i _ 2n 

\la 2 -b 2 ^ Jk/M-c 2 /AM 2 


( 21 ) 


Also, its "frequency"/is given by 


/ = 


1 

T 



J_ Ik c 2 
2k V M 4M 2 ‘ 


( 22 ) 


This number is usually called the natural frequency of the system. When 
the viscosity vanishes, so that c = 0, it is clear that (21) and (22) reduce to (7) 
and (8). Furthermore, on comparing (8) and (22) we see that the frequency of 
the vibration is decreased by damping, as we might expect. 



FIGURE 28 
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Forced vibrations. The vibrations discussed above are known as free vibra¬ 
tions because all the forces acting on the system are internal to the system 
itself. We now extend our analysis to cover the case in which an impressed 
external force F e =f(t) acts on the cart. Such a force might arise in many ways: 
for example, from vibrations of the wall to which the spring is attached, or 
from the effect on the cart of an external magnetic field (if the cart is made of 
iron). In place of (9) we now have 



(23) 


so 



(24) 


The most important case is that in which the impressed force is periodic and 
has the form f(t)-F 0 cos cot, so that (24) becomes 



(25) 


We have already solved the corresponding homogeneous equation (10), so 
in seeking the general solution of (25) all that remains is to find a particular 
solution. This is most readily accomplished by the method of undetermined 
coefficients. Accordingly, we take x=A sin cof + B cos cot as a trial solution. 
On substituting this into (25), we obtain the following pair of equations for 
A and B: 


co cA + (k- co 2 M)B = F 0 , 


(k-co 2 M)A-cocB = 0. 


The solution of this system is 



and B = 


(k - co 2 M)F 0 


(k - © 2 M) 2 + ore 2 


Our desired particular solution is therefore 


x = 


(k - ro 2 M) 2 + ro 2 c 2 


[roc sin rot + (k - ro 2 M) cos rot]. 


(26) 
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By introducing 4> = tan _1 [a)c/(fc - a) 2 M)], we can write (26) in the more useful 
form 


x = 


yj(k - co 2 M) 2 + k>V 


-cos(o)f — ()>)• 


(27) 


If we now assume that we are dealing with the underdamped motion dis¬ 
cussed above, then the general solution of (25) is 


x = e ht (ciCOsat + c 2 sinaf)+ ,- 0 ^=cos(mf -())). (28) 

y](k - co 2 M) 2 + co 2 c 2 

The first term here is clearly transient in the sense that it approaches 0 as 
t As a matter of fact, this is true whether the motion is underdamped 
or not, as long as some degree of damping is present (see Problem 17-2). 
Therefore, as time goes on, the motion assumes the character of the sec¬ 
ond term, the steady-state part. On this basis, we can neglect the transient 
part of (28) and assert that for large t the general solution of (25) is essen¬ 
tially equal to the particular solution (27). The frequency of this forced 
vibration equals the impressed frequency a/2n, and its amplitude is the 
coefficient 


, , , (29) 

■yj(k - co 2 M) 2 + co 2 c 2 

This expression for the amplitude holds some interesting secrets, for it 
depends not only on co and F 0 but also on k, c, and M. As an example, we 
note that if c is very small and co is close to yjk/M (so that k - co 2 M is very 
small), which means that the motion is lightly damped and the impressed 
frequency co/2it is close to the natural frequency 


JL k c 2 

271 V M 4M 2 ' 

then the amplitude is very large. This phenomenon is known as resonance. 
A classic example is provided by the forced vibration of a bridge under the 
impact of the feet of marching columns of men whose pace corresponds 
closely to the natural frequency of the bridge. 

Finally, we mention briefly certain links between the mechanical prob¬ 
lem treated above and the electrical problem discussed in Section 13. 
It was shown in that section that if a periodic electromotive force E = E 0 cos at 
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acts in a simple circuit containing a resistor, an inductor, and a capacitor, 
then the charge Q on the capacitor is governed by the differential equation 



(30) 


This equation is strikingly similar to (25). In particular, the following cor¬ 
respondences suggest themselves: 


mass M <-> inductance L; 
viscosity c <-» resistance R; 



displacement x charge Q on capacitor. 


This analogy between the mechanical and electrical systems renders iden¬ 
tical the mathematics of the two systems, and enables us to carry over at 
once all mathematical conclusions from the first to the second. In the given 
electric circuit we therefore have a critical resistance below which the free 
behavior of the circuit will be vibratory with a certain natural frequency, a 
forced steady-state vibration of the charge Q, and resonance phenomena that 
appear when the circumstances are favorable. 


Problems 

1. Consider the forced vibration (27) in the underdamped case, and find the 
impressed frequency for which the amplitude (29) attains a maximum. 
Will such an impressed frequency necessarily exist? This value of the 
impressed frequency (when it exists) is called the resonance frequency. 
Show that it is always less than the natural frequency. 

2. Consider the underdamped free vibration described by formula (20). 
Show that x assumes maximum values for t = 0,T, 2 T r ..., where T is the 
"period" as given in formula (21). If x 1 and x 2 are any two successive 
maximum values of x, show that x l /x 2 -e bT . The logarithm of this quan¬ 
tity, bT, is known as the logarithmic decrement of the vibration. 

3. A spherical buoy of radius r floats half-submerged in water. If it is 
depressed slightly, a restoring force equal to the weight of the dis¬ 
placed water presses it upward; and if it is then released, it will bob up 
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and down. Find the period of oscillation if the friction of the water is 
neglected. 

4. A cylindrical buoy 2 feet in diameter floats with its axis vertical in fresh 
water of density 62.4 lb/ft 3 . When depressed slightly and released, its 
period of oscillation is observed to be 1.9 seconds. What is the weight 
of the buoy? 

5. Suppose that a straight tunnel is drilled through the earth between any 
two points on the surface. If tracks are laid, then—neglecting friction— 
a train placed in the tunnel at one end will roll through the earth under 
its own weight, stop at the other end, and return. Show that the time 
required for a complete round trip is the same for all such tunnels, and 
estimate its value. If the tunnel is 2L miles long, what is the greatest 
speed attained by the train? 

6. The cart in Figure 25 weighs 128 pounds and is attached to the wall by 
a spring with spring constant A; = 64 lb/ft. The cart is pulled 6 inches in 
the direction away from the wall and released with no initial velocity. 
Simultaneously a periodic external force F e =f(t) =32 sin At is applied 
to the cart. Assuming that there is no air resistance, find the position 
x-x(t) of the cart at time f. Note particularly that | x(t) | has arbitrarily 
large values as t -> °°, a phenomenon known as pure resonance and 
caused by the fact that the forcing function has the same period as the 
free vibrations of the unforced system. 

7. (This problem is intended only for students who are not intimidated 
by calculations with complex numbers.) The correspondence between 
equations (25) and (30) makes it easy to write down the steady-state 
solution of (30) by merely changing the notation in (27): 



Q = 


cos(a>f - (|)), 


(*) 


where tan (j) = coi?/(l/C - co 2 L). In electrical engineering it is customary 
to think of E 0 cos cot in (30) as the real part of E 0 e iat , and instead of (30) 
we would then consider the differential equation 



Find a particular solution of this equation by the method of unde¬ 
termined coefficients, and at the end of the calculation take the real 
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part of this solution and thereby obtain the solution (*) of the differ¬ 
ential equation (30). 8 


21 Newton's Law of Gravitation and The Motion of the Planets 

The inverse square law of attraction underlies so many natural phe¬ 
nomena—the orbits of the planets around the sun, the motion of the moon 
and artificial satellites about the earth, the paths described by charged 
particles in atomic physics, etc.—that every person educated in science 
ought to know something about its consequences. Our purpose in this sec¬ 
tion is to deduce Kepler's laws of planetary motion from Newton's law of 
universal gravitation, and to this end we discuss the motion of a small 
particle of mass m (a planet) under the attraction of a fixed large particle of 
mass M (the sun). 


The use of complex numbers in the mathematics of electric circuit problems was pioneered 
by the mathematician, inventor and electrical engineer Charles Proteus Steinmetz (1865- 
1923). As a young man in Germany, his student socialist activities got him into trouble with 
Bismarck's police, and he hastily emigrated to America in 1889. He was employed by the 
General Electric Company in its earliest period, and he quickly became the scientific brains 
of the Company and probably the greatest of all electrical engineers. When he came to GE 
there was no way to mass-produce electric motors or generators, and no economically viable 
way to transmit electric power more than 3 miles. Steinmetz solved these problems by using 
mathematics and the power of his own mind, and thereby improved human life forever in 
ways too numerous to count. 

He was a dwarf who was crippled by a congenital deformity and lived with pain, but he 
was universally admired for his scientific genius and loved for his warm humanity and 
puckish sense of humor. The following little-known but unforgettable anecdote about him 
was published in the Letters section of Life magazine (May 14,1965): 

Sirs: In your article on Steinmetz (April 23) you mentioned a consultation with 
Henry Ford. My father, Burt Scott, who was an employee of Henry Ford for many 
years, related to me the story behind that meeting. Technical troubles developed 
with a huge new generator at Ford's River Rouge plant. His electrical engineers 
were unable to locate the difficulty so Ford solicited the aid of Steinmetz. When 
"the little giant" arrived at the plant, he rejected all assistance, asking only for a 
notebook, pencil and cot. For two straight days and nights he listened to the gen¬ 
erator and made countless computations. Then he asked for a ladder, a measur¬ 
ing tape and a piece of chalk. He laboriously ascended the ladder, made careful 
measurements, and put a chalk mark on the side of the generator. He descended 
and told his skeptical audience to remove a plate from the side of the generator 
and take out 16 windings from the field coil at that location. The corrections were 
made and the generator then functioned perfectly. Subsequently Ford received 
a bill for $10,000 signed by Steinmetz for G.E. Ford returned the bill acknowl¬ 
edging the good job done by Steinmetz but respectfully requesting an itemized 
statement. Steinmetz replied as follows: Making chalk mark on generator $1. 
Knowing where to make mark $9,999. Total due $10,000. 
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FIGURE 29 


For problems involving a moving particle in which the force acting on it 
is always directed along the line from the particle to a fixed point, it is usu¬ 
ally simplest to resolve the velocity, acceleration, and force into components 
along and perpendicular to this line. We therefore place the fixed particle M 
at the origin of a polar coordinate system (Figure 29) and express the radius 
vector from the origin to the moving particle m in the form 

r = ru„ (1) 

where u,, is the unit vector in the direction of r. 9 It is clear that 

u,. = i cos 0 + j sin 0, (2) 

and also that the corresponding unit vector Ug, perpendicular to u r in the 
direction of increasing 0, is given by 

u e = -i sin 0 + j cos 0. (3) 


The simple relations 


du 

~dQ 


— r - = u e and -= -u 


du e 

dQ 


obtained by differentiating (2) and (3), are essential for computing the veloc¬ 
ity and acceleration vectors v and a. Direct calculation from (1) now yields 


9 We here adopt the usual convention of signifying vectors by boldface type. 
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dr du r 

v = — = r -+ u, 

dt dt 


dr du r d0 dr d8 dr 

— = r -+ u r — = r — Ue + —u r 

dt dQ dt dt dt dt 


( 4 ) 


and 


dv 

( d 2 0 

_ dr rf0 


d 2 r 

CN 

a = — = 

r — 7T + L - 

u e + 

—^-r — 

dt 

^ dt 2 

dt dt 

dt L 

\dt) 


( 5 ) 


If the force F acting on m is written in the form 

F=F(,u e +F r u r/ (6) 

then from (5) and (6) and Newton's second law of motion ma = F, we get 


/ 

m r 

v 


d 2 0 2 dr d0 N 
dt 2 dt dt y 




and 




( 7 ) 


These differential equations govern the motion of the particle m, and are 
valid regardless of the nature of the force. Our next task is to extract infor¬ 
mation from them by making suitable assumptions about the direction and 
magnitude of F. 


Central forces and Kepler's Second Law. F is called a central force if it has 
no component perpendicular to r, that is, if F e - 0. Under this assumption the 
first of equations (7) becomes 


d 2 0 . dr d0 

T -1- 2- 

dt 2 dt dt 


On multiplying through by r, we obtain 


d 2 0 , dr dQ 

- 1 - 2 r - 

dt 2 dt dt 


or 


±f 2 de] 
dt\ dt) 


= 0 , 



so 


(8) 
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for some constant h. We shall assume that h is positive, which evidently 
means that m is moving in a counterclockwise direction. If A-A(t) is the 
area swept out by r from some fixed position of reference, so that dA = r 2 dQ/2, 
then (8) implies that 


dA = 



-hdt. 
2 


On integrating (9) from t 1 to f 2 , we get 


( 9 ) 


A(t 2 )-A(h) = h(t 2 -h). (10) 

This yields Kepler's second law: the radius vector r from the sun to a planet 
sweeps out equal areas in equal intervals of time. 10 

Central gravitational forces and Kepler's First Law. We now specialize even 
further, and assume that F is a central attractive force whose magnitude— 
according to Newton's law of gravitation—is directly proportional to the 
product of the two masses and inversely proportional to the square of the 
distance between them: 


F, = -G 


Mm 


( 11 ) 


The letter G represents the gravitational constant, which is one of the universal 
constants of nature. If we write (11) in the slightly simpler form 


F, = 


km 


where k = GM, then the second of equations (7) becomes 


d 2 r (dQ 


dt v dt 


r — =—t ■ 


( 12 ) 


10 When the Danish astronomer Tycho Brahe died in 1601, his assistant Johannes Kepler (1571- 
1630) inherited great masses of raw data on the positions of the planets at various times. 
Kepler worked incessantly on this material for 20 years, and at last succeeded in distilling 
from it his three beautifully simple laws of planetary motion—which were the climax of 
thousands of years of purely observational astronomy. 
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The next step in this line of thought is difficult to motivate, because it involves 
considerable technical ingenuity, but we will try. Our purpose is to use the 
differential equation (12) to obtain the equation of the orbit in the polar form 
r =/(0), so we want to eliminate t from (12) and consider 0 as the independent 
variable. Also, we want r to be the dependent variable, but if (8) is used to put 
(12) in the form 


d 2 r h 2 _ k 


(13) 


then the presence of powers of 1/r suggests that it might be temporarily con¬ 
venient to introduce a new dependent variable z = 1/r. 

To accomplish these various aims, we must first express d 2 r/dt 2 in terms of 
d 2 z/dQ 2 , by calculating 

dr d f 1 h 1 dz _ 1 dz dd 1 dz h _ , dz 

dt dt vzj z 2 dt z 2 dQ dt z 2 d8 r 2 dO 


and 


d 2 r d f dz^t _ d (dz^dQ d 2 z h _ , 2 2 d 2 z 

dt 2 dtydQj dQydQJdt dQ 2 r 2 dQ 2 


When the latter expression is inserted in (13) and 1/r is replaced by z, 
we get 

-h 2 z 2d ^-h 2 z 3 = -kz 2 
dQ 2 

or 


d 2 z k 

d0 2 h 2 

The general solution of this equation can be written down at once: 

z = Asin0 + Bcos0 + --5v (14) 

h 

For the sake of simplicity, we shift the direction of the polar axis in such 
a way that r is minimal (that is, m is closest to the origin) when 0 = 0. This 
means that z is to be maximal in this direction, so 
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dz , d 2 z n 

— = 0 and — T < 0 
dQ dQ 2 


when 0 = 0. These conditions imply that A = 0 and B > 0. If we now replace z 
by 1/r, then (14) can be written 


r = 


h 2 /k 


k/h 2 + Bcos0 l + (Bh 2 jk) cos 0' 


and if we put e - Bh 2 /k, then our equation for the orbit becomes 

h 2 /k 


r = 


1 + e cos 0 


(15) 


where e is a positive constant. 

At this point we recall (Figure 30) that the locus defined by PF/PD - e is the 
conic section with focus F, directrix d, and eccentricity e. When this condition 
is expressed in terms of r and 0, it is easy to see that 


l + ecos0 

is the polar equation of our conic section, which is an ellipse, a parabola, or 
a hyperbola according as e < 1, e-1, or e > 1. These remarks show that the 
orbit (15) is a conic section with eccentricity e=Bh 2 /k ; and since the planets 
remain in the solar system and do not move infinitely far away from the sun, 
the ellipse is the only possibility. This yields Kepler's first law: the orbit of each 
planet is an ellipse with the sun at one focus. 



FIGURE 30 
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The physical meaning of the eccentricity. It follows from equation (4) that 
the kinetic energy of m is 


1 

2 


mv 2 



(16) 


The potential energy of the system is the negative of the work required to 
move m to infinity (where the potential energy is zero), and is therefore 



r 


If E is the total energy of the system, which is constant by the principle of 
conservation of energy, then (16) and (17) yield 


1 2 

m r 

2 




= £. 


(18) 


At the instant when 0 = 0, (15) and (18) give 


h 2 /k 

r =- 

1 + e 


and 


mr 2 h 2 km _ r 


It is easy to eliminate r from these equations; and when the result is solved 
for e, we find that 


e = 1 + E 

f 2h 2 } 

V 

{ mk 2 J 


This enables us to write equation (15) for the orbit in the form 

h 2 /k 

r = - , ' -. 

1 + f 1 + E(2h 2 /mk 2 ) cos0 


(19) 


It is evident from (19) that the orbit is an ellipse, a parabola, or a hyperbola 
according as £ < 0, E = 0, or £ > 0. It is therefore clear that the nature of the 
orbit of m is completely determined by its total energy £. Thus the planets 
in the solar system have negative energies and move in ellipses, and bodies 
passing through the solar system at high speeds have positive energies and 
travel along hyperbolic paths. It is interesting to realize that if a planet like 
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the earth could be given a push from behind, sufficiently strong to speed it 
up and lift its total energy above zero, it would enter into a hyperbolic orbit 
and leave the solar system permanently. 


The periods of revolution of the planets and Kepler's Third Law. We now 

restrict our attention to the case in which m has an elliptic orbit (Figure 31) 
whose polar and rectangular equations are (15) and 


It is well known from elementary analytic geometry that e-c/a and c 2 -a 2 - b 2 , 
so e 2 -(a 2 - b 2 )/a 2 and 


b 2 =a 2 { 1-e 2 ). (20) 

In astronomy the semimajor axis of the orbit is called the mean distance, 
because it is one-half the sum of the least and greatest values of r, so (15) and 
(20) give 


a = 


f h 2 /k 
1 + e 


h 2 /k' 

1-e, 


h 2 _ h 2 a 2 
k( 1-e 2 ) ~ kb 2 ' 


and we have 


b 2 = 


h 2 a 

IT’ 


( 21 ) 



FIGURE 31 
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If T is the period of m (that is, the time required for one complete revolution 
in its orbit), then since the area of the ellipse is mb it follows from (10) that 
mb=hT/1. In view of (21), this yields 


T z = 


An * 1 2 a 2 b 2 

h 2 


An' 

k 


2 A 


( 22 ) 


In the present idealized treatment, the constant k-GM depends on the cen¬ 
tral mass M but not on m, so (22) holds for all the planets in our solar system 
and we have Keplers' third law: the squares of the periods of revolution of the 
planets are proportional to the cubes of their mean distances. 

The ideas of this section are of course due primarily to Newton 
(Appendix B). However, the arguments given here are quite different from 
those that were used in print by Newton himself, for he made no explicit 
use of the methods of calculus in any of his published works on physics or 
astronomy. For him calculus was a private method of scientific investigation 
unknown to his contemporaries, and he had to rewrite his discoveries into 
the language of classical geometry whenever he wished to communicate 
them to others. 


Problems 

1. In practical work with Kepler's third law (22), it is customary to measure 
T in years and a in astronomical units (1 astronomical unit = the earth's 
mean distance = 93,000,000 miles = 150,000,000 kilometers). With these 
convenient units of measurement, (22) takes the simpler form T 2 = a 3 . 
What is the period of revolution T of a planet whose mean distance 
from the sun is 

(a) twice that of the earth? 

(b) three times that of the earth? 

(c) twenty-five times that of the earth? 

2. (a) Mercury's "year" is 88 days. What is Mercury's mean distance from 

the sun? 

(b) The mean distance of the planet Saturn is 9.54 astronomical units. 
What is Saturn's period of revolution about the sun? 

3. Kepler's first two laws, in the form of equations (8) and (15), imply 
that m is attracted toward the origin with a force whose magnitude is 
inversely proportional to the square of r. This was Newton's fundamen¬ 
tal discovery, for it caused him to propound his law of gravitation and 
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investigate its consequences. Prove this by assuming (8) and (15) and 
verifying the following statements: 

(a) F b = 0; 
dr ke 

(b) — = — sin0; 
at h 

,. d 2 r ke cos0 

(c) d?’— ] 

(d) 

r r 

4. Show that the speed v of a planet at any point of its orbit is given by 


v 2 =k 


2 1 


5. Suppose that the earth explodes into fragments which fly off at 
the same speed in different directions into orbits of their own. Use 
Kepler's third law and the result of Problem 4 to show that all frag¬ 
ments that do not fall into the sun or escape from the solar system 
will reunite later at the same point where they began to diverge. 


22 Higher Order Linear Equations. 

Coupled Harmonic Oscillators 

Even though the main topic of this chapter is second order linear equations, 
there are several aspects of higher order linear equations that make it worth¬ 
while to discuss them briefly. 

Most of the ideas and methods described in Sections 14 to 19 are easily 
extended to nth order linear equations with constant coefficients, 

y (n) + Hy {n ~ l) + ■ ■ • + a n _! y' + a n y =f(x), (1) 

where f(x ) is assumed to be continuous on an interval [a,b]. The basic 
fact to keep in mind is that the general solution of (1) has the form we 
expect. 


y(x)=y g (x)+y p (x), 

where y p (x) is any particular solution of (1) and y (x) is the general solution of 
the reduced homogeneous equation 
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y (n) ++ ■ ■ • + fl-i y'+«„ y=o. 


( 2 ) 


The proof is exactly the same as the proof for the case n = 2, and will not be 
repeated. 

We begin by considering the problem of finding the general solution of the 
homogeneous equation (2). Our experience with the case n = 2 tells us that 
this equation probably has solutions of the form y = e rx for suitable values of 
the constant r. By substituting y=e rx and its derivatives into (2) and dividing 
out the nonzero factor e rx , we obtain the auxiliary equation 


r n + a 1 r”~ 1 + • ■ ■ + a n _pr + a n - 0. 


(3) 


The polynomial on the left side of (3) is called the auxiliary polynomial; in prin¬ 
ciple it can always be factored completely into a product of n linear factors, 
and equation (3) can then be written in the factored form 


(r-rjXr-rj)--- (r-r„) = 0. 


The constants r v r 2r .., r n are the roots of the auxiliary equation (3). If these 
roots are distinct from one another, then we have n distinct solutions 



(4) 


of the homogeneous equation (2). Just as in the case n = 2, the linear 
combination 


y(x) = C\e nx + Coe' 21 + • • • + c n e r " x 


(5) 


is also a solution for every choice of the coefficients c v c 2/ ..., c n . 

Since (5) contains n arbitrary constants, we have reasonable grounds for 
hoping that it is the general solution of the nth order equation (2). To elevate 
this hope into a certainty, we must appeal to a small body of theory that we 
now sketch very briefly. 

When the theorems of Sections 14 and 15 are extended in the natural way, 
it can be proved that (5) is the general solution of (2) if the solutions (4) are 
linearly independent. 11 There are several ways of establishing the fact that 
the solutions (4) are linearly independent whenever the roots r v r 2 ,..., r n are 


11 This requires establishing the same connections as before among (1) satisfying n initial con¬ 
ditions, (2) the nonvanishing of the Wronskian, (3) Abel's formula, and (4) linear indepen¬ 
dence. A set of n functions i/j(x), y 2 (x),..., y„{x) is said to be linearly dependent if one of them 
can be expressed as a linear combination of the others, and linearly independent if this is not 
possible. In specific cases this is usually easy to decide by inspection. Equivalently, linear 
dependence means that there exists a relation of the form c 1 y 1 (x) + c 2 y 2 (x) + ••• + c„y n (x) = 0 in 
which at least one of the c's is not zero, and linear independence means that any such rela¬ 
tion implies that all the c's must be zero. 
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distinct, but we omit the details. It therefore follows that (5) actually is the 
general solution of (2) in this case. 


Repeated real roots. If the real roots of (3) are not all distinct, then the solu¬ 
tions (4) are linearly dependent and (5) is not the general solution. For exam¬ 
ple, if r, = r 2 then the part of (5) consisting of c k e nx + c 2 e nx becomes (ci + c 2 )e nx , 
and the two constants c x and c 2 become one constant c, + c 2 . To see what to do 
when this happens, we recall that in the special case of the second order equa¬ 
tion, where we had only the two roots r 1 and r 2 , we found that when r 1 = r 2 the 
solution c r e nx + c 2 e nx had to be replaced by c k e nx + c 2 xe nx = (c x + c 2 x)e nx . It can 
be verified by substitution that if r 1 = r 2 for the «th order equation (2), then the 
first two terms of (5) must be replaced by this same expression. 

More generally, if r 1 = r 2 = — = r k is a real root of multiplicity k (that is, a fc-fold 
repeated root) of the auxiliary equation (3), then the first k terms in the solu¬ 
tion (5) must be replaced by 

(ci + c 2 x + c 3 x 2 + • • • + q c x k ~ 1 )e nx . 

A similar family of solutions is needed for each multiple real root, giving a 
correspondingly modified form of (5). In the next section we will show how 
to obtain these expressions by operator methods. 


Complex roots. Some of the roots of the auxiliary equation (3) may be com¬ 
plex numbers. Since the coefficients of (3) are real, all complex roots occur in 
conjugate complex pairs a + ib and a - ib. As in the case n = 2, the part of the 
solution (5) corresponding to two such roots can be written in the alternative 
real form 


e ax (A cos bx + B sin bx). 

If a + ib and a - ib are roots of multiplicity k, then we must take 

e ax [(A x + A 2 x + ■ ■ ■ + A k x k ~ l ) cos bx 
+ (B 1 + B 2 x + • • • + B k x k_1 ) sin bx] 

as part of the general solution. 


Example 1. The differential equation 


y< 4 > - 5i/"+4y = 0 


has auxiliary equation 

r 4 - 5r 2 +4 = (r 2 - l)(r 2 - 4) = (r - l)(r + l)(r - 2)(r + 2) = 0. 
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Its general solution is therefore 


y = c 3 e x + c 2 e~ x +c 3 e 2x +c 4 e~ 2x . 


Example 2. The equation 


y (4> - 8 y" + 16y = 0 


has auxiliary equation 


r 4 - 8r 2 +16 = (r 2 - 4) 2 = (r - 2) 2 (r + 2) 2 = 0, 


so the general solution is 

y =(Cj + c 2 x)e 2x + (c 3 + c 4 x)e~ 2x . 


Example 3. The equation 

y<«-2y" + 2y'-2y' + y = 0 


has auxiliary equation 


,4 _ 2r 3 + 2r 2 - 2r +1 = 0, 


or, after factoring, 12 


(r - l) 2 (r 2 +l) = 0. 

The general solution is therefore 

y=(c 1 + c 2 x)e x +c 3 cos x + c 4 sin x. 


Example 4. Coupled harmonic oscillators. Linear equations of order 
n > 2 arise most often in physics by eliminating variables from simultane¬ 
ous systems of second order equations. We can see an example of this by 
linking together two simple harmonic oscillators of the kind discussed 
at the beginning of Section 20. Accordingly, let two carts of masses w, 
and m 2 be attached to the left and right walls in Figure 32 by springs with 
spring constants k 4 and k 2 . If there is no damping and these carts are left 
unconnected, then when disturbed each moves with its own simple har¬ 
monic motion, that is, we have two independent harmonic oscillators. 
We obtain coupled harmonic oscillators if we now connect the carts to each 
other by a spring with spring constant k 3 , as indicated in the figure. By 
applying Newton's second law of motion, it can be shown (Problem 16) 


12 To factor the auxiliary equation, notice that r = 1 is a root that can be found by inspection, 
so r - 1 is a factor of the auxiliary polynomial and the other factor can be found by long 
division. 



Second Order Linear Equations 


159 


h 




^3 

m 2 





nn 




i * 1 1 


x2 


FIGURE 32 


that the displacements x 1 and x 2 of the carts satisfy the following simul¬ 
taneous system of second order linear equations: 


mi = -hxi + k 3 (x 2 - Xi) 
= — (fci + k 3 )xi + k 3 x 3/ 
m 2 = -k 3 (x 2 -x^- k 2 x 2 
= k 3 xi-(k 2 +k 3 )x 2 . 


( 6 ) 


We can now obtain a single fourth order equation for x 3 by solving the 
first equation for x 2 and substituting in the second equation (Problem 17). 

We have not yet addressed the problem of finding a particular solution for 
the complete equation (1). In this context it suffices to remark that the method 
of undetermined coefficients discussed in Section 18 continues to apply, with 
obvious minor changes, for functions/(x) of the types considered in that sec¬ 
tion. In the next section we shall examine a totally different approach to the 
problem of finding particular solutions. 


Example 5. Find a particular solution of the differential equation 
y'" + 2y" - y' = 3x 2 - 2x + 1. 

Our experience in Section 18 suggests that we take a trial solution of 
the form 


y = x(a 0 + fljX + a 2 x 2 ) 

= fl 0 X + fl 1 X 2 + fl 2 X 3 . 

Since y' =a 0 + 2aiX + 3a 2 x 2 , y" = 2a 3 + 6a 2 x, and y"' = 6a 2 , substitution in the 
given equation yields 

6« 2 +2(2flj + 6a 2 x) - (a Q + 2a 3 x + 3 a 2 x 2 ) = 3x 2 - 2x +1 
or, after collecting coefficients of like powers of x, 

-3 a 2 x 2 + + 12n 2 )x + (- a 0 + 4^! + 6fl 2 ) = 3x 2 - 2x +1. 
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Thus, 


— 3a 2 =3, 

— 2a ^ +12fl 2 = — 2, 

-a 0 + 4 flj + 6fl 2 = 1, 

so a 2 = -l, a 1 = - 5, and a g =-27. We therefore have a particular solution 
y = -27x - 5x 2 - x 3 . 


Problems 

Find the general solution of each of the following equations. 

1. y'" - 3y"+2y' = 0. 

2. y'" -3y"+4y'-2y = 0. 

3. y'" - y = 0. 

4. y"'+y = 0. 

5. y"' + 3y" + 3y'+y = 0. 

6. y (4) + 4y+ 6y" + 4y' + y = 0. 

7. y(4)_y = 0. 

8. y (4, + 5y" +4y = 0. 

9. y (4) - 2fl 2 y" + tz 4 y = 0. 

10. y (4) + 2fl 2 y"+« 4 y = 0. 

11. y< 4 > + 2y"' + 2y" + 2y' + y = 0. 

12. y (4) + 2y'" - 2y" - 6y' + 5y = 0. 

13. y'" -6y" + lly'-6y = 0. 

14. y (4) + y"' - 3y" - 5y' - 2y = 0. 

15. y< 5 ) - 6y« - 8y"' +48y'' + 16y' - 96y = 0. 

16. Derive equations (6) for the coupled harmonic oscillators by using the 
configuration shown in Figure 32, where both carts are displaced to the 
right from their equilibrium positions and x 2 > x v so that the spring on 
the right is compressed and the other two are stretched. 

17. In Example 4, find the fourth order differential equation for x 1 by elimi¬ 
nating x 2 as suggested. 

18. In the preceding problem, solve the fourth order equation for x, if the 
masses are equal and the spring constants are equal, so that m 1 = m 2 = m 
and k 1 =k 2 =k 3 =k. In this special case, show directly (that is, with¬ 
out using the symmetry of the situation) that x 2 satisfies the same 
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differential equation as x v The two frequencies associated with these 
coupled harmonic oscillators are called the normal frequencies of the sys¬ 
tem. What are they? 

19. Find the general solution of y (4) = 0. Of i/ 4) = sin x + 24. 

20. Find the general solution of i/"' - 3y" + 2y' = 10 + 42e 3r . 

21. Find the solution of y"'-y' = l that satisfies the initial conditions 
y(0) = y'(0)=y"(0)=4. 

22. Show that the change of independent variable x-e z transforms the 
third order Euler equidimensional equation 

x 3 y+ a x x 2 y" + a 2 xy' + a 3 y = 0 


into a third order linear equation with constant coefficients. (This trans¬ 
formation also works for the nth order Euler equation.) Solve the fol¬ 
lowing equations by this method: 

(a) x 3 y"' + 3x 2 y" = 0; 

(b) x 3 y"' + x 2 y" - 2xy' + 2y = 0; 

(c) x 3 y'" + 2x 2 y" + xy' -y = 0. 

23. In determining the drag on a small sphere moving at a constant speed 
through a viscous fluid, it is necessary to solve the differential equation 

x 3 y (4) + 8x 2 y"' + 8xy" - 8y' = 0. 


Notice that this is an Euler equation for y' and use the method of 
Problem 22 to show that the general solution is 

y = <qx 2 + c 2 x _1 + c 3 x -3 + c 4 . 

These ideas are part of the mathematical background used by Robert A. 
Millikan in his famous oil-drop experiment of 1909 for measuring the 
charge on an electron, for which he won the 1923 Nobel Prize. 13 


23 Operator Methods for Finding Particular Solutions 

At the end of Section 22 we referred to the problem of finding particular 
solutions for nonhomogeneous equations of the form 


dflf 

dx n 


+ Cl\ 


JH-1„ , 

d _ y_ 

dx n ~ l 


H-1- (Z„_ 1 


dy 

dx 


+ a n y = /(x). 


(1) 


13 For a clear explanation of this exceedingly ingenious experiment, with a good drawing of 
the apparatus, see pp. 50-51 of the book by Linus Pauling mentioned in Section 4 [Note 12]. 
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In this section we give a very brief sketch of the use of differential opera¬ 
tors for solving this problem in more efficient ways than any we have seen 
before. These "operational methods" are mainly due to the English applied 
mathematician Oliver Heaviside (1850-1925). Heaviside's methods seemed 
so strange to the scientists of his time that he was widely regarded as a 
crackpot, which unfortunately is a common fate for thinkers of unusual 
originality. 

Let us represent derivatives by powers of D, so that 


dy 2 dry 
ax ax 


...,D"y 


d n y 

dx n ‘ 


Then (1) can be written as 

D"y + + • • • + + a„y = f(x), (2) 


or as 


(D" + fli D n 1 + • • • + + a„ )y = f(x), 


or as 


p(D)y=f(x), (3) 

where the differential operator p(D) is simply the auxiliary polynomial p(r) 
with r replaced by D. The successive application of two or more such opera¬ 
tors can be made by first multiplying the operators together by the usual 
rules of algebra and then applying the product operator. For example, we 
know that p(D) can be formally factored into 


p(D) = (D-r 1 )(D-r 2 )-(D-r n ), (4) 

where r y r 2 ,..., r n are the roots of the auxiliary equation; and these factors can 
then be applied successively in any order to yield the same result as a single 
application of p(D). As an illustration of this idea, we point out that if the 
auxiliary equation is of the second decree and therefore has only two roots r 1 
and r 2 , then formally we have 

(D - rf(D - r 2 ) -D 2 - (r, + r 2 )D + (5) 


( D-r 2 )y 



dy 

dx 


-by. 


and since 
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we can verify (5) by writing 


(D-r 1 )(D-r 2 )y = ^-r 1 j^-r 2 i/j 


d f dy 
dxydx 



-n 



= ^A-(ti+r 2 )‘j- + r 1 r 2 y 
ax ax 

= D 2 y - (r-i + r 2 )Dy + pr 2 y 
= [D 2 -(r 1 + r 2 )D + r 1 r 2 ]y, 


for this is the meaning of (5). 

We have no difficulty with the meaning of the expression p(D)y on the left 
of (3): it has the same meaning as the left side of (2) or (1). Our purpose now is 
to learn how to treat p(D) as a separate entity, and in doing this to develop the 
methods for solving (3) that are the subject of this section. Without beating 
around the bush, we wish to "solve formally" for y in (3), obtaining 


y = 



( 6 ) 


Here 1 /p(D) represents an operation to be performed on f(x) to yield y. The 
question is, what is the nature of this operation, and how can we carry it out? 
In order to begin to understand these matters, we consider the simple equa¬ 
tion Dy =f(x), which gives 


y = ^f(x). 

But Dy=f(x), or equivalently dy/dx=f(x), is easily solved by writing 

y = J f(x)dx, 

so it is natural to make the definition 

^m=j/(x)dx. ( 7 ) 

This tells us that the operator 1/D applied to a function means integrate the 
function. Similarly, the operator 1/D 2 applied to a function means integrate 
the function twice in succession, and so on. Operators like 1/D and 1/D 2 



164 


Differential Equations with Applications and Historical Notes 


are called inverse operators. We continue this line of investigation and exam¬ 
ine other inverse operators. Consider 

(D - r)y =f(x), (8) 

where r is a constant. Formally, we have 

y = -pp—/(*)• 

D-r 

But (8) is the simple first order linear equation 

ry=f(x ), 
dx 


whose solution by Section 10 is 


y = e w j"e rx f( x ) dx. 


(We suppress constants of integration because we are only seeking particu¬ 
lar solutions.) It is therefore natural to make the definition 

- i —f(x) = e rx \e- rx f( X )dx. (9) 

D-r J 

Notice that this reduces to (7) when r = 0. We are now ready to begin carrying 
out the problem-solving procedures that arise from (6). 


METHOD 1: SUCCESSIVE INTEGRATIONS. By using the factorization 
(4), we can write formula (6) as 

y = —-— f(x) =--- f(x) 

J p(D) JK (D-n)(D-r 2 )---(D-r n ) J 

11 1 

= — --—/(*). 

D-r\D-r 2 D-r n J 


Here we may apply the n inverse operators in any convenient order, and by 
(9) we know that the complete process requires n successive integrations. 
That the resulting function y = y(x) is a particular solution of (3) is easily seen; 
for by applying to y the factors of p(D) in suitable order, we undo the succes¬ 
sive integrations and arrive back at f(x). 
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Example 1. Find a particular solution of y" - 3y' + 2y = xe x . 
Solution. We have (D 2 - 3D + 2)y=xe x , so 

(D - 1) (D - 2) 1 / = xe x and y = —---— xe'. 

“ J D-1D-2 

By (9) and an integration by parts, we obtain 

^ ^ xe x = e 2x Je~ 2x xe x dx = -(1 + x)e x , 
so 

V = [-(! + x)e*] = -e z je~ x (l + x)e x dx = -|(1 + x)V. 


Example 2. Find a particular solution of y" - y=e _x . 
Solution. We have (D 2 - l)y =e~ x , so 

(D - 1)(D + l)y = e~ x , y = —-— <T X , 

D-1D + 1 


-- 1 —e” x = e“ x feV x dx = xe~ x , 

D + l J 


y = -- 1 —xe“ x = e x fe- x xe“ x dx =[-^x- i V* 

v D-l J l 2 4i 


METHOD 2: PARTIAL FRACTIONS DECOMPOSITIONS OF 
OPERATORS. The successive integrations of method 1 are likely to become 
complicated and time-consuming to carry out. The formula 


y = 


1 

m 


/w= 


i 

(D-r 1 )(D-r 2 )---(D-r„) 


/(*) 


suggests a way to avoid this work, for it suggests the possibility of decom¬ 
posing the operator on the right into partial fractions. If the factors of p(D) 
are distinct, we can write 


y = 


p(D) 


f(x) = 


A? 


Ai 

D-p D-r 2 


- + ••• + - 


D-r„ 


fix) 
















166 


Differential Equations with Applications and Historical Notes 


for suitable constants A lr A 2 ,..., A n , and each term on the right can be found 
by using (9). The operator in brackets here is sometimes called the Heaviside 
expansion of the inverse operator l/p(D). 


Example 3. Solve the problem in Example 1 by this method. 
Solution. We have 


y = 


1 


(D 1)(D — 2) 


D-2 D-l 


1 x 1 
- xe - xe 


D-2 D-l 
= e 2x je~ 2x xe x dx - e x je~ x xe x dx 
1 


= -l(l+x)e x -^x 2 e x =~(l + x + jx 2 )e x . 


The student will notice that this solution is not quite the same as the 
solution found in Example 1. However, it is easy to see that they differ 
only by a solution of the reduced homogeneous equation, so all is well. 


Example 4. Solve the problem in Example 2 by this method. 
Solution. We have 


(D-l)(D + l) f 2\_D-1 D + l 
= -^e x je~ x e~ x dx - ^e~ x je x e~ x dx 



1 

2 


xe 


-X 


If some of the factors of p(D) are repeated, then we know that the form of 
the partial fractions decomposition is different. For example if D - r l is a 
/c-fold repeated factor, then the decomposition contains the terms 

A\ | A 2 | | At 

D-n (D-n) 2 (D-nf 


These operators can be applied to f(x) in order from left to right, each requir¬ 
ing an integration based on the result of the preceding step, as in method 1. 

METHOD 3: SERIES EXPANSIONS OF OPERATORS. For problems in 
which/(x) is a polynomial, it is often useful to expand the inverse operator 
l/p(D) in a power series in D, so that 
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y = 


1 

p(D) 


f(x) = (1 + + b 2 D 2 + ■ • •)/(*)• 


The reason for this is that high derivatives of polynomials disappear, because 
D k x n = 0 iik>n. 


Example 5. Find a particular solution of y'" - 2y" + y = x 4 +2x +5. 
Solution. We have (D 3 - 2D 2 + l)y= x 4 +2x + 5, so 


y = 


1- 2D 2 + D 


(x 4 + 2x + 5). 


By ordinary long division we find that 

-\ r = 1 + 2D 2 -D 3 +4D 4 -4D 5 +---, 

1-2 D 2 + D 3 

so 

y = (1 + 2D 2 - D 3 + 4D 4 -4D 5 + ■ • -)(x 4 + 2x + 5) 
= (x 4 +2x + 5) + 2(12x 2 ) - (24x) + 4(24) 

= x 4 + 24x 2 - 22x +101. 


In order to make the fullest use of this method, it is desirable to keep in mind 
the following series expansions from elementary algebra: 

1 1 

-= 1 + r + r 2 + r 3 + ••• and -= 1 -r + r 2 - r 3 + ■■■. 

1 -r 1+r 

In this context we are only interested in these formulas as "formal" series 
expansions, and have no need to concern ourselves with their convergence 
behavior. 


Example 6. Find a particular solution of y+y" + y' + y = x 5 - 2x 2 +x. 
Solution. We have (D 3 + D 2 +D + l) y=x 5 - 2x 2 + x, so 


y =-=-^(x 3 - 2x 3 + x) 

3 1 + D + D 2 +D 3 


1-D 


(1-D)(x 5 -2x 2 + x) 


- 2l ' 2 +x) ~ (51 ' 4 “ 4x + 1} ] 

= (1 + D 4 + D 8 + • • -)[x 5 - 5x 4 - 2x 2 + 5x -1] 
=(x 3 - 5x 4 - 2x 2 + 5x -1) + (120x -120) 

= x 3 - 5x 4 - 2x 2 + 125x -121. 
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The remarkable thing about the procedures illustrated in these examples is 
that they actually work! 


METHOD 4: THE EXPONENTIAL SHIFT RULE. As we know, exponen¬ 
tial functions behave in a special way under differentiation. This fact enables 
us to simplify our work whenever/(x) contains a factor of the form e kx . Thus, 
if f(x) = e kx g(x), we begin by noticing that 

(D - r)f(x) = (D - r)e kx g(x) 

= e kx Dg(x) + ke kx g(x) - re kx g(x) 

= e kx (D + k - r)g(x). 

By applying this formula to the successive factors D - r v D -r 2/ D - r n , we see 
that for the polynomial operator p(D), 

p(D)e kx g(x) = e kx p(D + k)g(x). (10) 


This says that we can move the factor e kx to the left of the operator p(D) if we 
replace D by D + k in the operator. 

The same property is valid for the inverse operator l/p(D), that is. 


1 

P(D) 


e hc g(x) = e kx 


1 _ 

p(D + k) 


£(*)• 


( 11 ) 


To see this, we simply apply p(D) to the right side and use (10): 

1 , , „ 1 


p(D)e k 


P(D~k ) 


g(x) = e p(D + k) 


p(D + k) 


g(x) = e g(x). 


Properties (10) and (11) are called the exponential shift rule. They are useful in 
moving exponential functions out of the way of operators. 


Example 7. Solve the problem in Example 1 by this method. 
Solution. We have (D 2 - 3D + 2)y = xe x , so 


y = 


D 2 - 3D + 2 
1 


-xe = e 


(D +1) 2 - 3(D +1) + 2 


* 1 1 

= e' —=- x = -e - x 

D 2 -D Dl-D 


= -e*| -^- + 1 + 0 +D 2 + ■•■ \x 


= -e x \ ^x 2 + x + l |, 


as we have already seen in Examples 1 and 3. 
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Interested readers will find additional material on the methods of this section 
in the "Historical Introduction" to H. S. Carslaw and J. C. Jaeger, Operational 
Methods In Applied Mathematics, Dover, New York, 1963; and in E. Stephens, The 
Elementary Theory of Operational Mathematics, McGraw-Hill, New York, 1937. 


Problems 

1. Find a particular solution of y"-Ay-e 2x by using each of Methods 1 
and 2. 

2. Find a particular solution of y" - y = x * 2 e 2x by using each of Methods 1,2, 
and 4. 

In Problems 3 to 6, find a particular solution by using Method 1. 

3. y" + 4y' + Ay = 10x 3 e _2 *. 

4. y" - 2y' +y~e x . 

5- y" -y = er x . 

6. y" - 2y' -3 y- 6e 5x . 

In Problems 7 to 15, find a particular solution by using Method 3. 

7. y" - y' + y = x 3 - 3x 2 + l. 

8. y'" - 2y' +y = 2x 3 - 3x 2 + 4x + 5. 

9. 4y" + y = x 4 . 

10. y< 5 > - y"' =x 2 . 

11. y (6) * * - y = x 10 . 

12. y" + y' - y = 3x - x 4 . 

13. y" + y = x 4 . 

14. y"' - y" = 12x - 2. 

15. y'" + y" = 9x 2 - 2x +1. 

In Problems 16 to 18, find a particular solution by using Method 4. 

16. y" - 4y' + 3y = x 3 e 2 *. 

17. y" - 7y' + 12y =e 2x (x 3 - 5x 2 ). 

18. y" + 2y'+y = 2x 2 e _2 * + 3e 2 *. 

In Problems 19 to 24, find a particular solution by any method. 

19. y'" - 8y = 16x 2 . 

20. y( 4 >-y=l -x 3 . 

21. y"' -Iy' = x. 

22. y< 4 > = x -3 . 
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23. y'" -y" + y'=x + l. 

24. y'" + 2y" = x. 

25. Use the exponential shift rule to find the general solution of each of the 
following equations: 

(a) (D - 2 ) 3 y = e 2x [hint: multiply by e~ 2x and use (10)]; 

(b) (D + l)hj=l2e- x ; 

(c) (D - 2) 2 y = e 2 *sin x. 

26. Consider the nth order homogeneous equation p(D)y = 0. 

(a) If a polynomial q(r) is a factor of the auxiliary polynomial p(r), show 
that any solution of the differential equation q(D)y = 0 is also a solu¬ 
tion of p(D)y = 0. 

(b) If r 1 is a root of multiplicity k of the auxiliary equation p(r) = 0, show 
that any solution of (D - rf-y = 0 is also a solution of p(D)y = 0. 

(c) Use the exponential shift rule to show that (D - r t ) k y = 0 has 

y = (ci + c 2 x + c 3 x 2 + • • ■ + c k x k - 1 )e nx 

as its general solution. Hint: (D - r f y = 0 is equivalent to 
e nx D k {e~ nx y) = 0. 


Appendix A. Euler 

Leonhard Euler (1707-1783) was Switzerland's foremost scientist and one 
of the three greatest mathematicians of modern times (the other two being 
Gauss and Riemann). 

He was perhaps the most prolific author of all time in any field. From 1727 
to 1783 his writings poured out in a seemingly endless flood, constantly add¬ 
ing knowledge to every known branch of pure and applied mathematics, 
and also to many that were not known until he created them. He averaged 
about 800 printed pages a year throughout his long life, and yet he almost 
always had something worthwhile to say and never seems long-winded. The 
publication of his complete works was started in 1911, and the end is not in 
sight. This edition was planned to include 887 titles in 72 volumes, but since 
that time extensive new deposits of previously unknown manuscripts have 
been unearthed, and it is now estimated that more than 100 large volumes 
will be required for completion of the project. Euler evidently wrote mathe¬ 
matics with the ease and fluency of a skilled speaker discoursing on subjects 
with which he is intimately familiar. His writings are models of relaxed clar¬ 
ity. He never condensed, and he reveled in the rich abundance of his ideas 
and the vast scope of his interests. The French physicist Arago, in speaking 
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of Euler's incomparable mathematical facility, remarked that "He calculated 
without apparent effort, as men breathe, or as eagles sustain themselves in 
the wind." He suffered total blindness during the last 17 years of his life, 
but with the aid of his powerful memory and fertile imagination, and with 
helpers to write his books and scientific papers from dictation, he actually 
increased his already prodigious output of work. 

Euler was a native of Basel and a student of John Bernoulli at the University, 
but he soon outstripped his teacher. His working life was spent as a member 
of the Academies of Science at Berlin and St. Petersburg, and most of his 
papers were published in the journals of these organizations. His business 
was mathematical research, and he knew his business. He was also a man of 
broad culture, well versed in the classical languages and literatures (he knew 
the Aeneid by heart), many modern languages, physiology, medicine, botany, 
geography, and the entire body of physical science as it was known in his 
time. However, he had little talent for metaphysics or disputation, and came 
out second best in many good-natured verbal encounters with Voltaire at the 
court of Frederick the Great. His personal life was as placid and uneventful 
as is possible for a man with 13 children. 

Though he was not himself a teacher, Euler has had a deeper influence on 
the teaching of mathematics than any other man. This came about chiefly 
through his three great treatises: Introductio in Analysin Infinitorum (1748); 
Institutiones Calcidi Differentialis (1755); and Institutiones Calcidi Integmlis 
(1768-1794). There is considerable truth in the old saying that all elementary 
and advanced calculus textbooks since 1748 are essentially copies of Euler 
or copies of copies of Euler. 14 These works summed up and codified the dis¬ 
coveries of his predecessors, and are full of Euler's own ideas. He extended 
and perfected plane and solid analytic geometry, introduced the analytic 
approach to trigonometry, and was responsible for the modern treatment 
of the functions log x and e x . He created a consistent theory of logarithms of 
negative and imaginary numbers, and discovered that log x has an infinite 
number of values. It was through his work that the symbols e, Jt, and i (= V-1) 
became common currency for all mathematicians, and it was he who linked 
them together in the astonishing relation e m = -1. This is merely a special 
case (put 0 = jt) of his famous formula e'° = cos 0 + / sin 0, which connects the 
exponential and trigonometric functions and is absolutely indispensable in 
higher analysis. 15 Among his other contributions to standard mathematical 


14 See C. B. Boyer, "The Foremost Textbook of Modern Times," Am. Math. Monthly, Vol. 58, 
pp. 223-226,1951. 

15 An even more astonishing consequence of his formula is the fact that an imaginary power of an 
imaginary number can be real, in particular i‘ = e~^ 2 ; for if we put 0 = it/2, we obtain e*‘ yz = i, so 

i‘ = (e” /2 )' = e” 2 / 2 = e~” /2 . 

Euler further showed that i' has infinitely many values, of which this calculation produces 
only one. 
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notation were sin x, cos x, the use of f(x) for an unspecified function, and the 
use of E for summation. 16 Good notations are important, but the ideas behind 
them are what really count, and in this respect Euler's fertility was almost 
beyond belief. He preferred concrete special problems to the general theories 
in vogue today, and his unique insight into the connections between appar¬ 
ently unrelated formulas blazed many trails into new areas of mathematics 
which he left for his successors to cultivate. 

He was the first and greatest master of infinite series, infinite products, 
and continued fractions, and his works are crammed with striking discover¬ 
ies in these fields. James Bernoulli (John's older brother) found the sums of 

several infinite series, but he was not able to find the sum of the reciprocals 
111 

of the squares, 1 + — + — + —+.... He wrote, "If someone should succeed in 
4 4 9 16 

finding this sum, and will tell me about it, I shall be much obliged to him." 
In 1736, long after James's death, Euler made the wonderful discovery that 


,111 n 2 

1 +—+-+— + ... = — 
4 9 16 6 


He also found the sums of the reciprocals of the fourth and sixth powers. 


,11 ,11 7 i 4 

1 H-v- -I-, -I-— 1 +-1-— H-—- 

2 4 3 4 16 81 90 


and 


l + - r +—r + --- = l + +-+ •••= -. 

2 6 3 6 64 729 945 

When John heard about these feats, he wrote, "If only my brother were alive 
now." 17 Few would believe that these formulas are related—as they are—to 
Wallis's infinite product (1656), 

n = 2 2 4 4 6 6 

2 “ lV3 V5 V”' 


Euler was the first to explain this in a satisfactory way, in terms of his infinite 
product expansion of the sine. 


sinx 

x 


„2 A 


1 -- 


„2 \ 


1- 


4n 


„ 2 \ 


1- 


9n 


16 See F. Cajori, A History of Mathematical Notations, Open Court, Chicago, 1929. 

17 The world is still waiting—more than 200 years later—for someone to discover the sum of 
the reciprocals of the cubes. 
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Wallis's product is also related to Brouncker's remarkable continued fraction. 


n 

4 


1 



which became understandable only in the context of Euler's extensive 
researches in this field. 

His work in all departments of analysis strongly influenced the further 
development of this subject through the next two centuries. He contributed 
many important ideas to differential equations, including substantial parts 
of the theory of second order linear equations and the method of solution by 
power series. He gave the first systematic discussion of the calculus of varia¬ 
tions, which he founded on his basic differential equation for a minimizing 
curve. He introduced the number now known as Euler's constant, 

y = lim (1 + — + — H-f — - log n ) = 0.5772..., 

»-»°°y 2 3 n J 

which is the most important special number in mathematics after n and e. He 
discovered the integral defining the gamma function. 


00 

r(x) = 

o 

which is often the first of the so-called higher transcendental functions that 
students meet beyond the level of calculus, and he developed many of its 
applications and special properties. He also worked with Fourier series, 
encountered the Bessel functions in his study of the vibrations of a stretched 
circular membrane, and applied Laplace transforms to solve differential 
equations—all before Fourier, Bessel, and Laplace were born. Even though 
Euler died about 200 years ago, he lives everywhere in analysis. 

E. T. Bell, the well-known historian of mathematics, observed that "One 
of the most remarkable features of Euler's universal genius was its equal 
strength in both of the main currents of mathematics, the continuous and the 
discrete." In the realm of the discrete, he was one of the originators of mod¬ 
ern number theory and made many far-reaching contributions to this subject 
throughout his life. In addition, the origins of topology—one of the dominant 
forces in modern mathematics—lie in his solution of the Konigsberg bridge 
problem and his formula V-E+F = 2 connecting the numbers of vertices. 
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edges, and faces of a simple polyhedron. In the following paragraphs, we 
briefly describe some of his activities in these fields. 

In number theory, Euler drew much of his inspiration from the challeng¬ 
ing marginal notes left by Fermat in his copy of the works of Diophantus. 
He gave the first published proofs of both Fermat's theorem and Fermat's 
two squares theorem. He later generalized the first of these classic results 
by introducing the Euler c|) function; his proof of the second cost him 
7 years of intermittent effort. In addition, he proved that every positive 
integer is a sum of four squares and investigated the law of quadratic 
reciprocity. 

Some of his most interesting work was connected with the sequence of 
prime numbers, that is, with those integers p > 1 whose only positive divisors 

1 1 

are 1 and p. His use of the divergence of the harmonic series 1 + — + — + - to 

prove Euclid's theorem that there are infinitely many primes is so simple and 
ingenious that we venture to give it here. Suppose that there are only N primes, 
say p v p 2/ ..., p N Then each integer n > 1 is uniquely expressible in the form 
n = p'l' pf ■■■ p‘f. If a is the largest of these exponents, then it is easy to see that 


1 + 


1 

—+ 
2 


1 

— + ••• + 
3 


1 

n 





f 1 1 

1 



' 1 1 

o 

l P 2 

+ ^- + - 

P2 



1 H-1- 2 —1“ ' 

^ Pn Pn 

+ Pn, 


by multiplying out the factors on the right. But the simple formula 
1 + x + x 2 + — = 1/(1 - x), which is valid for | x | <1, shows that the factors in the 
above product are less than the numbers 

1 _1 _1 

1-1/Pl 7 1-1/P2 ' '1-1//V 

SO 

l + l + l + ... + l< ft n 

2 3 n pi -1 p 2 -1 Pn- 1 

for every n. This contradicts the divergence of the harmonic series and shows 
that there cannot exist only a finite number of primes. He also proved that 
the series 


11111 1 1 
-1- 1 -1-1-1- 1 -h • • * 

2 3 5 7 11 13 17 
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of the reciprocals of the primes diverges, and discovered the following won¬ 
derful identity: if s > 1, then 



1 

1-1 If’ 


where the expression on the right denotes the product of the numbers 
(1 - p _s ) _1 for all primes p. We shall return to this identity later, in our note on 
Riemann in Appendix E in Chapter 5. 

He also initiated the theory of partitions, a little-known branch of number 
theory that turned out much later to have applications in statistical mechan¬ 
ics and the kinetic theory of gases. A typical problem of this subject is to 
determine the number p(n) of ways in which a given positive integer n can 
be expressed as a sum of positive integers, and if possible to discover some 
properties of this function. For example, 4 can be partitioned into 4 = 3 + 1 = 
2 + 2 = 2 + l + l = l + l + l + l, so p( 4) = 5, and similarly p(5) = 7 and p(6) = 11. It is 
clear that p(n) increases very rapidly with n, so rapidly, in fact, that 18 

p(200) = 3,972,999,029,388. 

Euler began his investigations by noticing (only geniuses notice such things) 
that p(n) is the coefficient of x" when the function [(1 - x)(l - x 2 )(l - x 3 ) —] _1 is 
expanded in a power series: 

--——-3— = 1 + p( l)x + p{ 2)x 2 + p(3)x 3 +.... 

(1 — x)(l — X )(l-x )... 


By building on this foundation, he derived many other remarkable identities 
related to a variety of problems about partitions. 19 


18 This evaluation required a month's work by a skilled computer in 1918. His motive was to 
check an approximate formula for p{n), namely 


p(n) 


1 

4j iS 




(the error was extremely small). 

19 See Chapter XIX of G. H. Hardy and E. M. Wright, An Introduction to the Theory of Numbers, 
Oxford University Press, 1938; or Chapters 12-14 of G. E. Andrews, Number Theory, W. B. 
Saunders, San Francisco, 1971. These treatments are "elementary" in the technical sense that 
they do not use the high-powered machinery of advanced analysis, but nevertheless they are 
far from simple. For students who wish to experience some of Euler's most interesting work 
in number theory at first hand, and in a context not requiring much previous knowledge, 
we recommend Chapter VI of G. Polya's fine book. Induction and Analogy in Mathematics, 
Princeton University Press, 1954. 
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FIGURE 33 

The Konigsberg bridges. 


The Konigsberg bridge problem originated as a pastime of Sunday stroll¬ 
ers in the town of Konigsberg (now Kaliningrad) in what was formerly East 
Prussia. There were seven bridges across the river that flows through the 
town (see Figure 33). The residents used to enjoy walking from one bank to 
the islands and then to the other bank and back again, and the conviction 
was widely held that it is impossible to do this by crossing all seven bridges 
without crossing any bridge more than once. Euler analyzed the problem by 
examining the schematic diagram given on the right in the figure, in which 
the land areas are represented by points and the bridges by lines connecting 
these points. The points are called vertices, and a vertex is said to be odd or 
even according as the number of lines leading to it is odd or even. In modern 
terminology, the entire configuration is called a graph, and a path through 
the graph that traverses every line but no line more than once is called an 
Eider path. An Euler path need not end at the vertex where it began, but if it 
does, it is called an Euler circuit. By the use of combinatorial reasoning, Euler 
arrived at the following theorems about any such graph: (1) there are an even 
number of odd vertices; (2) if there are no odd vertices, there is an Euler cir¬ 
cuit starting at any point; (3) if there are two odd vertices, there is no Euler 
circuit, but there is an Euler path starting at one odd vertex and ending at the 
other; (4) if there are more than two odd vertices, there are no Euler paths. 20 
The graph of the Konigsberg bridges has four odd vertices, and therefore, by 
the last theorem, has no Euler paths. 21 The branch of mathematics that has 
developed from these ideas is known as graph theory ; it has applications to 
chemical bonding, economics, psychosociology, the properties of networks 
of roads and railroads, and other subjects. 

A polyhedron is a solid whose surface consists of a number of polygonal 
faces, and a regular polyhedron has faces that are regular polygons. As we 
know, there exists a regular polygon with n sides for each positive integer 


20 Euler's original paper of 1736 is interesting to read and easy to understand; it can be 
found on pp. 573-580 of J. R. Newman (ed). The World of Mathematics, Simon and Schuster, 
New York, 1956. 

21 It is easy to see—without appealing to any theorems— that this graph contains no Euler 
circuit, for if there were such a circuit, it would have to enter each vertex as many times as 
it leaves it, and therefore every vertex would have to be even. Similar reasoning shows also 
that if there were an Euler path that is not a circuit, there would be two odd vertices. 
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FIGURE 34 

Regular polyhedra. 


n = 3,4, 5,..and they even have special names—equilateral triangle, square, 
regular pentagon, etc. However, it is a curious fact—and has been known 
since the time of the ancient Greeks—that there are only five regular polyhe¬ 
dra, those shown in Figure 34, with names given in the table below. 

The Greeks studied these figures assiduously, but it remained for Euler to 
discover the simplest of their common properties: If V, E and F denote the num¬ 
bers of vertices, edges, and faces of any one of them, then in every case we have 

V-E + F = 2. 

This fact is known as Euler's formula for polyhedra, and it is easy to verify from 
the data summarized in the following table. 


Tetrahedron 

Cube 

Octahedron 

Dodecahedron 

Icosahedron 


V 

E 

F 

4 

6 

4 

8 

12 

6 

6 

12 

8 

20 

30 

12 

12 

30 

20 


This formula is also valid for any irregular polyhedron as long as it is simple — 
which means that it has no "holes" in it, so that its surface can be deformed 
continuously into the surface of a sphere. Figure 35 shows two simple irregu¬ 
lar polyhedra for which V - E + F = 6 - 10 + 6 = 2 and V - E + F = 6 - 9 + 5 = 2. 
However, Euler's formula must be extended to 


V - E + F = 2 - 2p 

















178 


Differential Equations with Applications and Historical Notes 




FIGURE 35 



FIGURE 36 

in the case of a polyhedron with p holes (a simple polyhedron is one for 
which p = 0). Figure 36 illustrates the cases p = 1 and p = 2; here we have 
V-E +F = 16-32 +16-0 whenp = l, and V-E + F = 24 - 44 + 18 = -2whenp = 2. 
The significance of these ideas can best be understood by imagining a poly¬ 
hedron to be a hollow figure with a surface made of thin rubber, and inflating 
it until it becomes smooth. We no longer have flat faces and straight edges, 
but instead a map on the surface consisting of curved regions, their boundar¬ 
ies, and points where boundaries meet. The number V-E + F has the same 
value for all maps on our surface, and is called the Eider characteristic of this 
surface. The number p is called the genus of the surface. These two numbers, 
and the relation between them given by the equation V-E + F = 2-2p, are evi¬ 
dently unchanged when the surface is continuously deformed by stretching 
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or bending. Intrinsic geometric properties of this kind—which have little 
connection with the type of geometry concerned with lengths, angles, and 
areas—are called topological. The serious study of such topological properties 
has greatly increased during the past century, and has furnished valuable 
insights to many branches of mathematics and science. 22 

The distinction between pure and applied mathematics did not exist in 
Euler's day, and for him the entire physical universe was a convenient object 
whose diverse phenomena offered scope for his methods of analysis. The 
foundations of classical mechanics had been laid down by Newton, but Euler 
was the principal architect. In his treatise of 1736 he was the first to explicitly 
introduce the concept of a mass-point or particle, and he was also the first to 
study the acceleration of a particle moving along any curve and to use the 
notion of a vector in connection with velocity and acceleration. His continued 
successes in mathematical physics were so numerous, and his influence was 
so pervasive, that most of his discoveries are not credited to him at all and are 
taken for granted by physicists as part of the natural order of things. However, 
we do have Euler's equations of motion for the rotation of a rigid body, Euler's 
hydrodynamic equation for the flow of an ideal incompressible fluid, Euler's 
law for the bending of elastic beams, and Euler's critical load in the theory 
of the buckling of columns. On several occasions the thread of his scientific 
thought led him to ideas his contemporaries were not ready to assimilate. 
For example, he foresaw the phenomenon of radiation pressure, which is cru¬ 
cial for the modern theory of the stability of stars, more than a century before 
Maxwell rediscovered it in his own work on electromagnetism. 

Euler was the Shakespeare of mathematics—universal, richly detailed, 
and inexhaustible. 23 


Appendix B. Newton 

Most people are acquainted in some degree with the name and reputation 
of Isaac Newton (1642-1727), for his universal fame as the discoverer of the 
law of gravitation has continued undiminished over the two and a half cen¬ 
turies since his death. It is less well known, however, that in the immense 
sweep of his vast achievements he virtually created modern physical science, 
and in consequence has had a deeper influence on the direction of civilized 
life than the rise and fall of nations. Those in a position to judge have been 


22 Proofs of Euler's formula and its extension are given on pp. 236-240 and 256-259 of R. Courant 
and H. Robbins, What Is Mathematics?, Oxford University Press, 1941. See also G. Polya, op. 
cit., pp. 35-43. 

23 For further information, see C. Truesdell, "Leonhard Euler, Supreme Geometer (1707-1783)," 
in Studies in Eighteenth-Century Culture, Case Western Reserve University Press, 1972. Also, 
the November 1983 issue of Mathematics Magazine is wholly devoted to Euler and his work. 
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unanimous in considering him one of the very few supreme intellects that 
the human race has produced. 

Newton was born to a farm family in the village of Woolsthorpe in north¬ 
ern England. Little is known of his early years, and his undergraduate life 
at Cambridge seems to have been outwardly undistinguished. In 1665 an 
outbreak of the plague caused the universities to close, and Newton returned 
to his home in the country, where he remained until 1667. There, in 2 years of 
rustic solitude—from age 22 to 24—his creative genius burst forth in a flood 
of discoveries unmatched in the history of human thought: the binomial 
series for negative and fractional exponents; differential and integral calcu¬ 
lus; universal gravitation as the key to the mechanism of the solar system; 
and the resolution of sunlight into the visual spectrum by means of a prism, 
with its implications for understanding the colors of the rainbow and the 
nature of light in general. In his old age he reminisced as follows about this 
miraculous period of his youth: "In those days I was in the prime of my age 
for invention and minded Mathematicks and Philosophy [i.e., science] more 
than at any time since." 24 

Newton was always an inward and secretive man, and for the most part 
kept his monumental discoveries to himself. He had no itch to publish, and 
most of his great works had to be dragged out of him by the cajolery and per¬ 
sistence of his friends. Nevertheless, his unique ability was so evident to his 
teacher, Isaac Barrow, that in 1669 Barrow resigned his professorship in favor 
of his pupil (an unheard-of event in academic life), and Newton settled down 
at Cambridge for the next 27 years. His mathematical discoveries were never 
really published in connected form; they became known in a limited way 
almost by accident, through conversations and replies to questions put to 
him in correspondence. He seems to have regarded his mathematics mainly 
as a fruitful tool for the study of scientific problems, and of comparatively 
little interest in itself. Meanwhile, Leibniz in Germany had also invented cal¬ 
culus independently; and by his active correspondence with the Bernoullis 
and the later work of Euler, leadership in the new analysis passed to the 
Continent, where it remained for 200 years. 25 

Not much is known about Newton's life at Cambridge in the early years 
of his professorship, but it is certain that optics and the construction of 
telescopes were among his main interests. He experimented with many 


24 The full text of this autobiographical statement (probably written sometime in the period 
1714-1720) is given on pp. 291-292 of I. Bernard Cohen, Introduction to Newton's 'Principia,' 
Harvard University Press, 1971. The present writer owns a photograph of the original 
document. 

25 It is interesting to read Newton's correspondence with Leibniz (via Oldenburg) in 1676 and 
1677 (see The Correspondence of Isaac Newton, Cambridge University Press, 1959-1976, 6 vol¬ 
umes so far). In Items 165, 172, 188, and 209, Newton discusses his binomial series but con¬ 
ceals in anagrams his ideas about calculus and differential equations, while Leibniz freely 
reveals his own version of calculus. Item 190 is also of considerable interest, for in it Newton 
records what is probably the earliest statement and proof of the Fundamental Theorem of 
Calculus. 
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techniques for grinding lenses (using tools which he made himself), and 
about 1670 built the first reflecting telescope, the earliest ancestor of the great 
instruments in use today at Mount Palomar and throughout the world. The 
pertinence and simplicity of his prismatic analysis of sunlight have always 
marked this early work as one of the timeless classics of experimental sci¬ 
ence. But this was only the beginning, for he went further and further in 
penetrating the mysteries of light, and all his efforts in this direction con¬ 
tinued to display experimental genius of the highest order. He published 
some of his discoveries, but they were greeted with such contentious stu¬ 
pidity by the leading scientists of the day that he retired back into his shell 
with a strengthened resolve to work thereafter for his own satisfaction alone. 
Twenty years later he unburdened himself to Leibniz in the following words: 
"As for the phenomena of colours.. . I conceive myself to have discovered the 
surest explanation, but I refrain from publishing books for fear that disputes 
and controversies may be raised against me by ignoramuses." 26 

In the late 1670s Newton lapsed into one of his periodic fits of distaste 
for science, and directed his energies into other channels. As yet he had 
published nothing about dynamics or gravity, and the many discoveries he 
had already made in these areas lay unheeded in his desk. At last, however, 
under the skillful prodding of the astronomer Edmund Halley (of Halley's 
Comet), he turned his mind once again to these problems and began to write 
his greatest work, the Principia. 27 

It all seems to have started in 1684 with three men in deep conversation in 
a London inn—Halley, and his friends Christopher Wren and Robert Hooke. 
By thinking about Kepler's third law of planetary motion, Halley had come 
to the conclusion that the attractive gravitational force holding the planets in 
their orbits was probably inversely proportional to the square of the distance 
from the sun. 28 However, he was unable to do anything more with the idea 
than formulate it as a conjecture. As he later wrote (in 1686): 

I met with Sir Christopher Wren and Mr. Hooke, and falling in discourse 
about it, Mr. Hooke affirmed that upon that principle all the Laws of 
the celestiall motions were to be demonstrated, and that he himself had 


26 Correspondence, Item 427. 

27 The full title is Philosophiae Naturalis Principia Mathematica (Mathematical Principles of Natural 
Philosophy). 

28 At that time this was quite easy to prove under the simplifying assumption—which contra¬ 
dicts Kepler's other two laws—that each planet moves with constant speed v in a circular 
orbit of radius r. [Proof: In 1673 Huygens had shown, in effect, that the acceleration a of such 
a planet is given by a = v 2 /r. If T is the periodic time, then 

(2tt rjTf in 2 r 3 

a ~ r ~ r 2 ’ j2 ' 

By Kepler's third law, T 2 is proportional to r 3 , so r 3 /T 2 is constant, and a is therefore inversely 
proportional to r 2 . If we now suppose that the attractive force F is proportional to the accel¬ 
eration, then it follows that F is also inversely proportional to r 2 .] 
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done it. I declared the ill success of my attempts; and Sir Christopher, 
to encourage the Inquiry, said that he would give Mr. Hooke or me 
two months' time to bring him a convincing demonstration therof, and 
besides the honour, he of us that did it, should have from him a pres¬ 
ent of a book of 40 shillings. Mr. Hooke then said that he had it, but 
that he would conceale it for some time, that others triing and failing, 
might know how to value it, when he should make it publick; however, 

I remember Sir Christopher was little satisfied that he could do it, and 
tho Mr. Hooke then promised to show it him, I do not yet find that in that 
particular he has been as good as his word. 29 

It seems clear that Halley and Wren considered Hooke's assertions to be 
merely empty boasts. A few months later Halley found an opportunity to 
visit Newton in Cambridge, and put the question to him: "What would be 
the curve described by the planets on the supposition that gravity dimin¬ 
ishes as the square of the distance?" Newton answered immediately, "An 
ellipse." Struck with joy and amazement, Halley asked him how he knew 
that. "Why," said Newton, "I have calculated it." Not guessed, or surmised, 
or conjectured, but calcidated. Halley wanted to see the calculations at once, 
but Newton was unable to find the papers. It is interesting to speculate on 
Halley's emotions when he realized that the age-old problem of how the 
solar system works had at last been solved—but that the solver hadn't both¬ 
ered to tell anybody and had even lost his notes. Newton promised to write 
out the theorems and proofs again and send them to Halley, which he did. 
In the course of fulfilling his promise he rekindled his own interest in the 
subject, and went on, and greatly broadened the scope of his researches. 30 

In his scientific efforts Newton somewhat resembled a live volcano, with 
long periods of quiescence punctuated from time to time by massive erup¬ 
tions of almost superhuman activity. The Principia was written in 18 incred¬ 
ible months of total concentration, and when it was published in 1687 it was 
immediately recognized as one of the supreme achievements of the human 
mind. It is still universally considered to be the greatest contribution to sci¬ 
ence ever made by one man. In it he laid down the basic principles of theo¬ 
retical mechanics and fluid dynamics; gave the first mathematical treatment 
of wave motion; deduced Kepler's laws from the inverse square law of gravi¬ 
tation, and explained the orbits of comets; calculated the masses of the earth, 
the sun, and the planets with satellites; accounted for the flattened shape 
of the earth, and used this to explain the precession of the equinoxes; and 
founded the theory of tides. These are only a few of the splendors of this pro¬ 
digious work. 31 The Principia has always been a difficult book to read, for the 


29 Correspondence, Item 289. 

30 For additional details and the sources of our information about these events, see Cohen, op. 
cit., pp. 47-54. 

31 A valuable outline of the contents of the Principia is given in Chapter VI of W. W. Rouse Ball, 
An Essay on Newton's Principia (first published in 1893; reprinted in 1972 by Johnson Reprint 
Corp, New York). 
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style has an inhuman quality of icy remoteness, which perhaps is appropriate 
to the grandeur of the theme. Also, the densely packed mathematics consists 
almost entirely of classical geometry, which was little cultivated then and 
is less so now. 32 In his dynamics and celestial mechanics, Newton achieved 
the victory for which Copernicus, Kepler, and Galileo had prepared the way. 
This victory was so complete that the work of the greatest scientists in these 
fields over the next two centuries amounted to little more than footnotes to 
his colossal synthesis. It is also worth remembering in this context that the 
science of spectroscopy, which more than any other has been responsible for 
extending astronomical knowledge beyond the solar system to the universe 
at large, had its origin in Newton's spectral analysis of sunlight. 

After the mighty surge of genius that went into the creation of the Principia, 
Newton again turned away from science. However, in a famous letter to 
Bentley in 1692, he offered the first solid speculations on how the universe of 
stars might have developed out of a primordial featureless cloud of cosmic 
dust: 


It seems to me, that if the matter of our Sun and Planets and all the mat¬ 
ter in the Universe was evenly scattered throughout all the heavens, and 
every particle has an innate gravity towards all the rest ... some of it 
would convene into one mass and some into another, so as to make an 
infinite number of great masses scattered at great distances from one to 
another throughout all that infinite space. And thus might the Sun and 
Fixt stars be formed, supposing the matter were of a lucid nature. 33 

This was the beginning of scientific cosmology, and later led, through the 
ideas of Thomas Wright, Kant, Herschel, and their successors, to the elabo¬ 
rate and convincing theory of the nature and origin of the universe provided 
by late twentieth century astronomy. 

In 1693 Newton suffered a severe mental illness accompanied by delu¬ 
sions, deep melancholy, and fears of persecution. He complained that he 
could not sleep, and said that he lacked his "former consistency of mind." He 
lashed out with wild accusations in shocking letters to his friends Samuel 
Pepys and John Locke. Pepys was informed that their friendship was over 
and that Newton would see him no more; Locke was charged with trying to 
entangle him with women and with being a "Hobbist" (a follower of Hobbes, 
i.e., an atheist and materialist). 34 Both men feared for Newton's sanity. They 
responded with careful concern and wise humanity, and the crisis passed. 


32 The nineteenth century British philosopher Whewell has a vivid remark about this: "Nobody 
since Newton has been able to use geometrical methods to the same extent for the like pur¬ 
poses; and as we read the Principia we feel as when we are in an ancient armoury where the 
weapons are of gigantic size; and as we look at them we marvel what manner of man he was 
who could use as a weapon what we can scarcely lift as a burden." 

33 Correspondence, Item 398. 

34 Correspondence, Items 420, 421, and 426. 
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In 1696 Newton left Cambridge for London to become Warden (and soon 
Master) of the Mint, and during the remainder of his long life he entered a 
little into society and even began to enjoy his unique position at the pinnacle 
of scientific fame. These changes in his interests and surroundings did not 
reflect any decrease in his unrivaled intellectual powers. For example, late 
one afternoon, at the end of a hard day at the Mint, he learned of a now- 
famous problem that the Swiss scientist John Bernoulli had posed as a chal¬ 
lenge "to the most acute mathematicians of the entire world." The problem 
can be stated as follows: Suppose two nails are driven at random into a wall, 
and let the upper nail be connected to the lower by a wire in the shape of a 
smooth curve. What is the shape of the wire down which a bead will slide 
(without friction) under the influence of gravity so as to pass from the upper 
nail to the lower in the least possible time? This is Bernoulli's brachistochrone 
("shortest time") problem. Newton recognized it at once as a challenge to him¬ 
self from the Continental mathematicians; and in spite of being out of the 
habit of scientific thought, he summoned his resources and solved it that 
evening before going to bed. His solution was published anonymously, and 
when Bernoulli saw it, he wryly remarked, "I recognize the lion by his claw." 

Of much greater significance for science was the publication of his Opticks 
in 1704. In this book he drew together and extended his early work on 
light and color. As an appendix he added his famous Queries, or specula¬ 
tions on areas of science that lay beyond his grasp in the future. In part the 
Queries relate to his lifelong preoccupation with chemistry (or alchemy, 
as it was then called). He formed many tentative but exceedingly careful 
conclusions—always founded on experiment—about the probable nature of 
matter; and though the testing of his speculations about atoms (and even 
nuclei) had to await the refined experimental work of the late nineteenth and 
early twentieth centuries, he has been proven absolutely correct in the main 
outlines of his ideas. 35 So, in this field of science too, in the prodigious reach 
and accuracy of his scientific imagination, he passed far beyond not only 
his contemporaries but also many generations of his successors. In addition, 
we quote two astonishing remarks from Queries 1 and 30, respectively: "Do 
Not Bodies act upon Light at a distance, and by their action bend its Rays?" 
and "Are not gross Bodies and Light convertible into one another?" It seems 
as clear as words can be that Newton is here conjecturing the gravitational 
bending of light and the equivalence of mass and energy, which are prime 
consequences of the theory of relativity. The former phenomenon was first 
observed during the total solar eclipse of May 1919, and the latter is now 
known to underlie the energy generated by the sun and the stars. On other 
occasions as well he seems to have known, in some mysterious intuitive way, 
far more than he was ever willing or able to justify, as in this cryptic sentence 
in a letter to a friend: "It's plain to me by the fountain I draw it from, though I 


35 See S. I. Vavilov, "Newton and the Atomic Theory," in Newton Tercentenary Celebrations, 
Cambridge University Press, 1947. 
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will not undertake to prove it to others." 36 Whatever the nature of this "foun¬ 
tain" may have been, it undoubtedly depended on his extraordinary powers 
of concentration. When asked how he made his discoveries, he said, "I keep 
the subject constantly before me and wait till the first dawnings open little 
by little into the full light." This sounds simple enough, but everyone with 
experience in science or mathematics knows how very difficult it is to hold 
a problem continuously in mind for more than a few seconds or a few min¬ 
utes. One's attention flags; the problem repeatedly slips away and repeatedly 
has to be dragged back by an effort of will. From the accounts of witnesses, 
Newton seems to have been capable of almost effortless sustained concentra¬ 
tion on his problems for hours and days and weeks, with even the need for 
occasional food and sleep scarcely interrupting the steady squeezing grip of 
his mind. 

In 1695 Newton received a letter from his Oxford mathematical friend John 
Wallis, containing news that cast a cloud over the rest of his life. Writing 
about Newton's early mathematical discoveries, Wallis warned him that 
in Flolland "your Notions" are known as "Leibniz's Calculus Differentialis," 
and he urged Newton to take steps to protect his reputation. 37 At that time 
the relations between Newton and Leibniz were still cordial and mutually 
respectful. However, Wallis's letters soon curdled the atmosphere, and initi¬ 
ated the most prolonged, bitter, and damaging of all scientific quarrels: the 
famous (or infamous) Newton-Leibniz priority controversy over the inven¬ 
tion of calculus. 

It is now well established that each man developed his own form of cal¬ 
culus independently of the other, that Newton was first by 8 or 10 years but 
did not publish his ideas, and that Leibniz's papers of 1684 and 1686 were 
the earliest publications on the subject. However, what are now perceived as 
simple facts were not nearly so clear at the time. There were ominous minor 
rumblings for years after Wallis's letters, as the storm gathered: 

What began as mild innuendoes rapidly escalated into blunt charges 
of plagiarism on both sides. Egged on by followers anxious to win a 
reputation under his auspices, Newton allowed himself to be drawn 
into the centre of the fray; and, once his temper was aroused by accu¬ 
sations of dishonesty, his anger was beyond constraint. Leibniz's con¬ 
duct of the controversy was not pleasant, and yet it paled beside that 
of Newton. Although he never appeared in public, Newton wrote most 
of the pieces that appeared in his defense, publishing them under the 
names of his young men, who never demurred. As president of the Royal 
Society, he appointed an "impartial" committee to investigate the issue, 
secretly wrote the report officially published by the society [in 1712], and 
reviewed it anonymously in the Philosophical Transactions. Even Leibniz's 
death could not allay Newton's wrath, and he continued to pursue the 


36 Correspondence, Item 193. 

37 Correspondence, Items 498 and 503. 
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enemy beyond the grave. The battle with Leibniz, the irrepressible 
need to efface the charge of dishonesty, dominated the final 25 years of 
Newton's life. Almost any paper on any subject from those years is apt to 
be interrupted by a furious paragraph against the German philosopher, 
as he honed the instruments of his fury ever more keenly. 38 

All this was bad enough, but the disastrous effect of the controversy on 
British science and mathematics was much more serious. It became a mat¬ 
ter of patriotic loyalty for the British to use Newton's geometrical methods 
and clumsy calculus notations, and to look down their noses at the upstart 
work being done on the Continent. However, Leibniz's analytical methods 
proved to be far more fruitful and effective, and it was his followers who 
were the moving spirits in the richest period of development in mathemati¬ 
cal history. What has been called "the Great Sulk" continued; for the British, 
the work of the Bernoullis, Euler, Lagrange, Laplace, Gauss, and Riemann 
remained a closed book; and British mathematics sank into a coma of impo¬ 
tence and irrelevancy that lasted through most of the eighteenth and nine¬ 
teenth centuries. 

Newton has often been thought of and described as the ultimate rational¬ 
ist, the embodiment of the Age of Reason. His conventional image is that of 
a worthy but dull absent-minded professor in a foolish powdered wig. But 
nothing could be further from the truth. This is not the place to discuss or 
attempt to analyze his psychotic flaming rages; or his monstrous vengeful 
hatreds that were unquenched by the death of his enemies and continued at 
full strength to the end of his own life; or the 58 sins he listed in the private 
confession he wrote in 1662; or his secretiveness and shrinking insecurity; 
or his peculiar relations with women, especially with his mother, who he 
thought had abandoned him at the age of 3. And what are we to make of 
the bushels of unpublished manuscripts (millions of words and thousands 
of hours of thought!) that reflect his secret lifelong studies of ancient chro¬ 
nology, early Christian doctrine, and the prophecies of Daniel and St. John? 
Newton's desire to know had little in common with the smug rationalism 
of the eighteenth century; on the contrary, it was a form of desperate self- 
preservation against the dark forces that he felt pressing in around him. 39 As 
an original thinker in science and mathematics he was a stupendous genius 
whose impact on the world can be seen by everyone; but as a man he was so 
strange in every way that normal people can scarcely begin to understand 
him. It is perhaps most accurate to think of him in medieval terms—as a con¬ 
secrated, solitary, intuitive mystic for whom science and mathematics were 
means of reading the riddle of the universe. 


38 Richard S. Westfall, in the Encyclopaedia Britannica. 

39 The best effort is Frank E. Manuel's excellent book, A Portrait of Isaac Newton, Harvard 
University Press, 1968. 
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24 Oscillations and the Sturm Separation Theorem 

It is natural to feel that a differential equation should be solved, and one 
of the main aims of our work in Chapter 3 was to develop ways of finding 
explicit solutions of the second order linear equation 

y" + P(x)y' + Q(x)y = 0. (1) 

Unfortunately, however—as we have tried to emphasize—it is rarely pos¬ 
sible to solve this equation in terms of familiar elementary functions. This 
situation leads us to seek wider vistas by formulating the problem at a higher 
level, and to recognize that our real goal is to understand the nature and 
properties of the solutions of (1). If this goal can be attained by means of 
elementary formulas for these solutions, well and good. If not, then we try to 
open up other paths to the same destination. In this brief chapter we turn our 
attention to the problem of learning what we can about the essential charac¬ 
teristics of the solutions of (1) by direct analysis of the equation itself, in the 
absence of formal expressions for these solutions. It is surprising how much 
interesting and useful information can be gained in this way. 

As an illustration of the idea that many properties of the solutions of a dif¬ 
ferential equation can be discovered by studying the equation itself, without 
solving it in any traditional sense, we discuss the familiar equation 


y"+y= o. (2) 

We know perfectly well that y 2 (x) = sin x and y 2 (x) = cos x are two linearly 
independent solutions of (2); that they are fully determined by the initial con¬ 
ditions y^O) = 0, yi(0) = 1 and y 2 (0) = 1, y' 2 ( 0) = 0; and that the general solution 
is y(x) = e,y,(x) + c 2 y 2 (x). Normally we regard (2) as completely solved by these 
observations, for the functions sin x and cos x are old friends and we know 
a great deal about them. However, our knowledge of sin x and cos x can be 
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FIGURE 37 

thought of as an accident of history; and for the sake of emphasizing our 
present point of view, we now pretend total ignorance of these familiar func¬ 
tions. Our purpose is to see how their properties can be squeezed out of (2) 
and the initial conditions they satisfy The only tools we shall use are quali¬ 
tative arguments and the general principles described in Sections 14 and 15. 

Accordingly, let y = s(x) be defined as the solution of (2) determined by the 
initial conditions s(0) = 0 and s'(0) = 1. If we try to sketch the graph of s(x) by 
letting x increase from 0, the initial conditions tell us to start the curve at the 
origin and let it rise with slope beginning at 1 (Figure 37). From the equa¬ 
tion itself we have s"(x) = -s(x), so when the curve is above the x-axis, s"(x) is 
a negative number that increases in magnitude as the curve rises. Since s"(x) 
is the rate of change of the slope s'(x), this slope decreases at an increasing 
rate as the curve lifts, and it must reach 0 at some point x = m. As x continues 
to increase, the curve falls toward the x-axis, s'(x) decreases at a decreasing 
rate, and the curve crosses the x-axis at a point we can define to be it. Since 
s"(x) depends only on s(x), we see that the graph between x = 0 and x = it is 
symmetric about the line x = m, so m = it/2 and s'(it) = -l. A similar argument 
shows that the next portion of the curve is an inverted replica of the first 
arch, and so on indefinitely. 

In order to make further progress, it is convenient at this stage to introduce 
y = c(x) as the solution of (2) determined by the initial conditions c(0) = 1 and 
c'(0) = 0. These conditions tell us (Figure 37) that the graph of c(x) starts at the 
point (0,1) and moves to the right with slope beginning at 0. since by equa¬ 
tion (2) we know that c"(x) = -c(x), the same reasoning as before shows that 
the curve bends down and crosses the x-axis. It is natural to conjecture that 
the height of the first arch of s(x) is 1, that the first zero of c(x) is it/2, etc.; but 
to establish these guesses as facts, we begin by showing that 

s'(x) = c(x) and c'(x) = -s(x). (3) 

To prove the first statement, we start by observing that (2) yields y'"+ y' - 0 
or (y')" + y' - 0, so the derivative of any solution of (2) is again a solution (see 
Problem 17-4). Thus s'(x) and c(x) are both solutions of (2), and by Theorem 14-A 
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it suffices to show that they have the same values and the same derivatives 
at x = 0. This follows at once from s'(0) = 1, c(0) = 1 and s"(0) = -s(0) = 0, c'(0) = 0. 
The second formula in (3) is an immediate consequence of the first, for 
c'(x) = s"(x) = -s{x). We now use (3) to prove 

s(x) 2 + c(x) 2 = l. (4) 

Since the derivative of the left side of (4) is 

2s(x)c(x) - 2c(x)s(x), 

which is 0, we see that s(x) 2 + c(x) 2 equals a constant, and this constant must be 
1 because s(0) 2 + c(0) 2 = 1. It follows at once from (4) that the height of the first 
arch of s(x) is 1 and that the first zero of c(x) is it/2. This result also enables 
us to show that s(x) and c(x) are linearly independent, for their Wronskian is 

W[s(x), c(x)] = s(x)c'(x) - c(x)s'(x) 

— —s(x) 2 - c(x) 2 = -l. 

In much the same way, we can continue and establish the following addi¬ 
tional facts: 


s(x + a) - s(x)c(a) + c(x)s(a); 

(5) 

c(x + a)- c(x)c(a) - s(x)s(a); 

(6) 

s(2x) = 2s(x)c(x); 

(7) 

c(2x) = c(x) 2 - s(x) 2 ; 

(8) 

s(x + 2ji)=s(x); 

(9) 

c(x + 2k) - c(x). 

(10) 


The proofs are not difficult, and we leave them to the reader (see Problem 1). 
Among other things, it is easy to see from the above results that the posi¬ 
tive zeros of s(x) and c(x) are, respectively, it, 2it, 3it, . . . and ji/ 2, it/2 + ir, 
ji/2 + 2n,.... 

There are two main points to be made about the above discussion. First, we 
have extracted almost every significant property of the functions sin x and 
cos x from equation (2) by the methods of differential equations alone, without 
using any prior knowledge of trigonometry Second, the tools we did use 
consisted chiefly of convexity arguments (involving the sign and magnitude 
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of the second derivative) and the basic properties of linear equations set forth 
in Sections 14 and 15. 

It goes without saying that most of the above properties of sin x and cos x 
are peculiar to these functions alone. Nevertheless, the central feature of 
their behavior—the fact that they oscillate in such a manner that their zeros 
are distinct and occur alternately—can be generalized far beyond these par¬ 
ticular functions. The following result in this direction is called the Sturm 
separation theorem. 1 

Theorem A. Ifyfx) and y 2 (x) are two linearly independent solutions of 

y" + P(x)y' + Q(x)y = 0, 

then the zeros of these functions are distinct and occur alternately—in the sense that 
yfx) vanishes exactly once between any two successive zeros ofy 2 (x), and conversely. 

Proof. The argument rests primarily on the fact (see the lemmas in Section 15) 
that since y v and y 2 are linearly independent, their Wronskian 

W(j h,yi) = yi{x)yz{x) - y 2 {x)y\(x) 

does not vanish, and therefore—since it is continuous—must have constant 
sign. First, it is easy to see that y v and y 2 cannot have a common zero; for 
if they do, then the Wronskian will vanish at that point, which is impos¬ 
sible. We now assume that x 1 and x 2 are successive zeros of y 2 and show 
that y 1 vanishes between these points. The Wronskian clearly reduces to 
yfx)y' 2 (x) at x x and x 2 , so both factors yfx) and t/ 2 (x) are # 0 at each of these 
points. Furthermore, y 2 {xf) and i/ 2 (x 2 ) must have opposite signs, because if 
y 2 is increasing at x, it must be decreasing at x 2 , and vice versa. Since the 
Wronskian has constant sign, yfx^} and y 1 (x 2 ) must also have opposite signs, 
and therefore, by continuity, y,(x) must vanish at some point between x 1 
and x 2 . Note that y 1 cannot vanish more than once between x, and x 2 ; for if 
it does, then the same argument shows that y 2 must vanish between these 
zeros of y v which contradicts the original assumption that x, and x 2 are suc¬ 
cessive zeros of y 2 . 

The convexity arguments given above in connection with the equation 
y" + y = 0 make it clear that in discussing the oscillation of solutions it is 


1 Jacques Charles Francois Sturm (1803-1855) was a Swiss mathematician who spent most of 
his life in Paris. For a time he was tutor to the de Broglie family, and after holding several 
other positions he at last succeeded Poisson in the Chair of Mechanics at the Sorbonne. His 
main work was done in what is now called the Sturm-Liouville theory of differential equa¬ 
tions, which has been of steadily increasing importance ever since in both pure mathematics 
and mathematical physics. 
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convenient to deal with equations in which the first derivative term is miss¬ 
ing. We now show that any equation of the form 

y" + P(x)y' + Q(x)y = 0 (11) 


can be written as 


u" + q(x)u = 0 (12) 

by a simple change of the dependent variable. It is customary to refer to (11) 
as the standard form, and to (12) as the normal form, of a homogeneous second 
order linear equation. To write (11) in normal form, we put y(x) = u{x)v(x), so 
that y' = uv' + u'v and y" = uv" + 2 u'v' + u"v. When these expressions are substi¬ 
tuted in (11), we obtain 


vu" + (2v' + Pv)u' + (v" + Pv' + Qv)u = 0. (13) 

On setting the coefficient of u' equal to zero and solving, we find that 



(14) 


reduces (13) to the normal form (12) with 


q(x) = Q(x)-^P(x) 2 -^P'(x). 


(15) 


Since v(x) as given by (14) never vanishes, the above transformation of (11) 
into (12) has no effect whatever on the zeros of solutions, and therefore leaves 
unaltered the oscillation phenomena which are the objects of our present 
interest. 

We next show that if q(x) in (12) is a negative function, then the solutions of 
this equation do not oscillate at all. 


Theorem B. Ifq(x) < 0, and ifu(x) is a nontrivial solution ofu"+q{x)u-0, then u(x) 
has at most one zero. 

Proof. Let x 0 be a zero of u(x), so that u(x 0 ) = 0. Since u(x) is nontrivial (i.e., 
is not identically zero). Theorem 14-A implies that u'(x 0 ) * 0. For the sake of 
concreteness, we now assume that u'(x 0 ) > 0, so that u(x) is positive over some 
interval to the right of x 0 . Since q(x) < 0, u"(x) = -q(x)u(x) is a positive function 
on the same interval. This implies that the slope u'(x) is an increasing func¬ 
tion, so u(x) cannot have a zero to the right of x 0 , and in the same way it has 
none to the left of x 0 . A similar argument holds when u'(x 0 ) < 0, so u(x) has 
either no zeros at all or only one, and the proof is complete. 
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Since our interest is in the oscillation of solutions, this result leads us 
to confine our study of (12) to the special case in which q(x) is a positive 
function. 

Even in this case, however, it is not necessarily true that solutions will 
oscillate. To get an idea of what is involved, let u(x) be a nontrivial solution 
of (12) with q(x) > 0. If we consider a portion of the graph above the x-axis 
(Figure 38), then ii"(x)=-q(x)u(x) is negative, so the graph is concave down 
and the slope u'(x) is decreasing. If this slope ever becomes negative, then the 
curve plainly crosses the x-axis somewhere to the right and we get a zero for 
u(x). We know that this happens when q(x) is constant. The alternative is that 
although i('(x) decreases, it never reaches zero and the curve continues to rise, 
as in the upper part of Figure 38. It is reasonably clear from these remarks 
that u(x) will have zeros as x increases whenever q(x) does not decrease too 
rapidly. This leads us to the next theorem. 

Theorem C. Let u(x) be any nontrivial solution of u" + q(x)u = 0, where q(x) > Ofor 
all x > 0 .If 


00 



(16) 


then u(x) has infinitely many zeros on the positive x-axis. 

Proof. Assume the contrary, namely, that i;(x) vanishes at most a finite num¬ 
ber of times for 0 < x < so that a point x 0 > 1 exists with the property that 
u(x) * 0 for all x > x 0 . We may clearly suppose, without any loss of generality. 
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that u(x) > 0 for all x > x 0 , since u(x) can be replaced by its negative if neces¬ 
sary. Our purpose is to contradict the assumption by showing that u'(x) is 
negative somewhere to the right of x 0 —for, by the above remarks, this will 
imply that u(x) has a zero to the right of x () . If we put 


v(x) = - 


u'(x) 

u(x) 


for x > x 0 , then a simple calculation shows that 

v'(x) = q(x) + v(x) * 1 2 3 4 ; 

and on integrating this from x 0 to x, where x > x 0 , we get 


X X 

v(x)-v(x 0 ) = ^q(x)dx + jv(x) 2 dx. 


We now use (16) to conclude that v(x) is positive if x is taken large enough. 
This shows that u(x) and u'(x) have opposite signs if x is sufficiently large, so 
u'(x) is negative and the proof is complete. 


Problems 

1. Prove formulas (5) to (10) by arguments consistent with the spirit of the 
preceding discussion. 

2. Show that the zeros of the functions a sin x + b cos x and c sin x + d cos x 
are distinct and occur alternately whenever ad - be * 0. 

3. Find the normal form of Bessel's equation 

x 2 y" + xy' + (x 2 - p 2 )y = 0, 

and use it to show that every nontrivial solution has infinitely many 
positive zeros. 

4. The hypothesis of Theorem C is false for the Euler equation 
y" + (k/x 2 )y = 0, but the conclusion is sometimes true and sometimes 
false, depending on the magnitude of the positive constant k. Show 
that every nontrivial solution has an infinite number of positive zeros 
if k > 1/4, and only a finite number if k < 1/4. 
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25 The Sturm Comparison Theorem 

In this section we continue our study of the oscillation behavior of nontrivial 
solutions of the differential equation 

y" + q(x)y = 0, (1) 

where q(x) is a positive function. We begin with a theorem that rules out the 
possibility of infinitely many oscillations on closed intervals. 


Theorem A. Let y(x) he a nontrivial solution of equation (1) on a closed interval 
[a, b]. Then y(x) has at most a finite number of zeros in this interval. 

Proof. We assume the contrary, namely, that y(x) has an infinite number of 
zeros in [a,b]. It follows from this that there exist in [a,b] a point x 0 and a 
sequence of zeros x„ # x 0 such that x n -> x 0 . 2 Since y(x) is continuous and dif¬ 
ferentiable at Xq, we have 


y(x 0 )= lim y(x„) = 0 

X n — 


and 


y'(x 0 )= lim 

X n —>*0 


y(x„)-y(x 0 ) 

X n Xq 


=0 . 


By Theorem 14-A, these statements imply that y(x) is the trivial solution of (1), 
and this contradiction completes the proof. 


We now recall that the Sturm separation theorem tells us that the zeros of any 
two (nontrivial) solutions of (1) either coincide or occur alternately, depend¬ 
ing on whether these solutions are linearly dependent or independent. Thus, 
all solutions of (1) oscillate with essentially the same rapidity, in the sense 
that on a given interval the number of zeros of any solution cannot differ by 
more than one from the number of zeros of any other solution. On the other 
hand, it is clear that solutions of 


y"+4y = 0 (2) 

oscillate more rapidly—that is, have more zeros—than solutions of 

y"+y=0; (3) 


2 In this inference we use the Bolzano-Weierstrass theorem of advanced calculus, which 
expresses one of the basic topological properties of the real number system. 
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for the zeros of a solution of (2) such as y = sin 2x are only half as far apart 
as the zeros of a solution y = sin x of (3). The following result, which is 
known as the Sturm comparison theorem, shows that this behavior is typi¬ 
cal in the sense that the solutions of (1) oscillate more rapidly when q(x) is 
increased. 


Theorem B. Let y(x) and z(x) be nontrivial solutions of 

y" + q{x)y = Q 


and 


z" + r(x)z = 0, 


where q(x) and r(x) are positive functions such that q(x) > r(x). Then y{x) vanishes at 
least once between any tzvo successive zeros ofz(x). 

Proof. Let x, and x 2 be successive zeros of z(x), so that z(xf = z(x 2 ) = 0 and z(x) 
does not vanish on the open interval (x u x 2 ). We assume that y(x) does not 
vanish on (x v x 2 ), and prove the theorem by deducing a contradiction. It is 
clear that no loss of generality is involved in supposing that both y(x) and z(x) 
are positive on {x v x 2 ), for either function can be replaced by its negative if 
necessary. If we emphasize that the Wronskian 

W(y, z)=y(x)z'(x) - z(x)y'(x) 
is a function of x by writing it W(x), then 


dW(x) 

dx 


=yz"-z\f 


= y(-rz)-z(-qy) 


= (q~r)yz> 0 


on (Xf,x 2 ). We now integrate both sides of this inequality from x y to x 2 and 
obtain 

W(x 2 ) - W(xj) >0 or W(x 2 ) > W(x,). 

However, the Wronskian reduces to y(x)z'(x) at x l and x 2 , so 

W(x 2 ) > 0 and W(x 2 ) < 0, 


which is the desired contradiction. 
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It follows from this theorem that if we have q(x) > k * 1 2 3 > 0 in equation (1), 
then any solution must vanish between any two successive zeros of a solu¬ 
tion y(x) = sin k(x - x 0 ) of the equation y" + k 2 y = 0, and therefore must vanish 
in any interval of length n/k. For example, if we consider Bessel's equation 

x 2 y" + xy' + (x 2 - p 2 )y = 0 

in normal form 


u + 


1 + 


l-4p 

4x 2 


2 A 


u = 0 , 


and compare this with u" + u = 0, then we at once have the next theorem. 

Theorem C. Let y p (x) be a nontrivial solution of Bessel’s equation on the positive 
x-axis. If0<p< 1/2, then every interval of length k contains at least one zero of 
y p (x); ifp = 1/2, then the distance between successive zeros ofy p {x) is exactly n; and if 
p> 1/2, then every interval of length k contains at most one zero ofy p (x). 

Bessel's equation is of considerable importance in mathematical physics. The 
oscillation properties of its solutions expressed in Theorem C, and also in 
Problem 24-3 and Problem 1 below, are clearly of fundamental significance 
for understanding the nature of these solutions. In Chapter 8 we shall devote 
a good deal of effort to finding explicit solutions for Bessel's equation in 
terms of power series. However, these series solutions are awkward tools to 
try to use in studying oscillation properties, and it is a great convenience to 
be able to turn to qualitative reasoning of the kind discussed in this chapter. 


Problems 

1. Let x 1 and x 2 be successive positive zeros of a nontrivial solution y p (x) of 
Bessel's equation. 

(a) If 0 < p < 1/2, show that x 2 - x, is less than ir and approaches jt as x, -* 

(b) If p > 1/2, show that x 2 - x, is greater than ir and approaches Jt as x, °°. 

2. If i/(x) is a nontrivial solution of y" + q(x)y = 0, show that y(x) has an infi¬ 
nite number of positive zeros if q(x) > k/x 2 for some k > 1/4, and only a 
finite number if q(x) < l/4x 2 . 

3. Every nontrivial solution of y" + (sin 2 x + \)y = 0 has an infinite number 
of positive zeros. Formulate and prove a theorem that includes this 
statement as a special case. 







Chapter 5 

Power Series Solutions and Special Functions 


26 Introduction. A Review of Power Series 

Most of the specific functions encountered in elementary analysis belong to 
a class known as the elementary functions. In order to describe this class, we 
begin by recalling that an algebraic function is a polynomial, a rational func¬ 
tion, or more generally any function y =f(x) that satisfies an equation of the 
form 


P n (x)y n +Pn-iWy"- 1 +■■■+ Pi(x)y +p 0 (*)=o, 

where each P,(x) is a polynomial. The elementary functions consist of the 
algebraic functions; the elementary transcendental (or nonalgebraic) functions 
occurring in calculus—i.e., the trigonometric, inverse trigonometric, expo¬ 
nential, and logarithmic functions; and all others that can be constructed 
from these by adding, subtracting, multiplying, dividing, or forming a func¬ 
tion of a function. Thus, 


y = tan 


xe 1/x +tan ^l + x 2 ) 
sin x cos 2x - logx 


is an elementary function. 

Beyond the elementary functions lie the higher transcendental functions, or, 
as they are often called, the special functions. Since the beginning of the eigh¬ 
teenth century, many hundreds of special functions have been considered 
sufficiently interesting or important to merit some degree of study. Most of 
these are almost completely forgotten but some, such as the gamma function, 
the Riemann zeta function, the elliptic functions, and those that continue 
to be useful in mathematical physics, have generated extensive theories. 
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And among these, a few are so rich in meaning and influence that the mere 
history of any one of them would fill a large book. 1 

The field of special functions was cultivated with enthusiastic devotion 
by many of the greatest mathematicians of the eighteenth and nineteenth 
centuries—by Euler, Gauss, Abel, Jacobi, Weierstrass, Riemann, Hermite, 
and Poincare, among others. But tastes change with the times, and today 
most mathematicians prefer to study large classes of functions (continu¬ 
ous functions, integrable functions, etc.) instead of outstanding individuals. 
Nevertheless, there are still many who favor biography over sociology, and a 
balanced treatment of analysis cannot neglect either view. 

Special functions vary rather widely with respect to their origin, nature, 
and applications. However, one large group with a considerable degree of 
unity consists of those that arise as solutions of second order linear differen¬ 
tial equations. Many of these find applications in connection with the par¬ 
tial differential equations of mathematical physics. They are also important, 
through the theory of orthogonal expansions, as the main historical source 
of linear analysis, which has played a central role in shaping much of mod¬ 
ern pure mathematics. 

Let us try to understand in a general way how these functions arise. It will 
be recalled that if we wish to solve the simple equation 


y"+y= o, (i) 

then the familiar functions y = sin x and y = cos x are already available for 
this purpose from elementary calculus. The situation with respect to the 
equation 


xy"+y'+xy =0 (2) 

is quite different, for this equation cannot be solved in terms of elementary 
functions. As a matter of fact, there is no known type of second order lin¬ 
ear equation—apart from those with constant coefficients, and equations 
reducible to these by changes of the independent variable—which can be 
solved in terms of elementary functions. In Chapter 4 we found that cer¬ 
tain general properties of the solutions of such an equation can often be 
established without solving the equation at all. But if a particular equation 
of this kind seems important enough to demand some sort of explicit solu¬ 
tion, what can we do? The approach we develop in this chapter is to solve 
it in terms of power series and to use these series to define new special 
functions. We then investigate the properties of these functions by means 
of their series expansions. If we succeed in learning enough about them. 


1 The reader who wishes to form an impression of the extent of this part of analysis would 
do well to look through the three volumes of Higher Transcendental Functions, A Erdelyi (ed.), 
McGraw-Hill, New York, 1953-1955. 
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then they attain the status of "familiar functions" and can be used as tools 
for studying the problem that gave rise to the original differential equa¬ 
tion. Needless to say, this program is easier to describe than to carry out, 
and is worthwhile only in the case of functions with a variety of significant 
applications. 

It is clear from the above remarks that we will be using power series 
extensively throughout this chapter. We take it for granted that most read¬ 
ers are reasonably well acquainted with these series from an earlier course 
in calculus. Nevertheless, for the benefit of those whose familiarity with 
this topic may have faded slightly, we present a brief review of the main 
facts. 

A. An infinite series of the form 


oo 

a n x = Uq + U\X + a^x 1 " + • • • (3) 

n =0 


is called a power series in x. The series 


Jji„(x-Xo) n =a 0 + a 1 (x-x 0 )+a 2 (x-Xo) 2 + ■■■ ( 4 ) 

n= 0 


is a power series in x-x 0 , and is somewhat more general than (3). 
However, (4) can always be reduced to (3) by replacing x -x 0 by x — 
which is merely a translation of the coordinate system—so for the 
most part we shall confine our discussion to power series of the 
form (3). 

B. The series (3) is said to converge at a point x if the limit 


m 


lim V 

m—>oo ( d 


a n x n 


n =0 


exists, and in this case the sum of the series is the value of this limit. 
It is obvious that (3) always converges at the point x = 0. With respect 
to the arrangement of their points of convergence, all power series in 
x fall into one or another or three major categories. These are typi¬ 
fied by the following examples: 


oo 



n=0 


= l + x + 2\x 2 +3!x 3 +•••; 


( 5 ) 
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oo 


I 

n =0 


X 


n 


n\ 


1 + x + 




( 6 ) 


^x” =l + x + x 2 + x 3 + ■■■. (7) 

n=0 


The first of these series diverges (i.e., fails to converge) for all x ± 0; 
the second converges for all x; and the third converges for |x|<l 
and diverges for |x|> 1. Some power series in x behave like (5), and 
converge only for x = 0. These are of no interest to us. Some, like (6), 
converge for all x. These are the easiest to work with. All others are 
roughly similar to (7). This means that to each series of this kind 
there corresponds a positive real number R, called the radius of con¬ 
vergence, with the property that the series converges if |x|<_R and 
diverges if | x | > R [R = 1 in the case of (7)]. 

It is customary to put R equal to 0 when the series converges only 
for x = 0, and equal to °° when it converges for all x. This convention 
allows us to cover all possibilities in a single statement: each power 
series in x has a radius of convergence R, where 0 < R < °°, with the 
property that the series converges if | x | < R and diverges if | x | > R. 
It should be noted that if R = 0 then no x satisfies | x | < R, and if R = °° 
then no x satisfies | x | > R. 

In many important cases the value of R can be found as 
follows. Let 


oo 

U n — Uq 4 " U] + 1/2 + ’ * ’ 

n =0 


be a series of nonzero constants. We recall from elementary calculus 
that if the limit 


lim^ 

»->°° u n 


= L 


exists, then the ratio test asserts that the series converges if L < 1 and 
diverges if L > 1. In the case of our power series (3), this tells us that 
if each a n / 0, and if for a fixed point x / 0 we have 


lim 

n —>00 


fl-n+lX 


n +1 


a n x n 


= lim fl " +1 

n —>00 n 
u n 


x = L, 
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then (3) converges if L < 1 and diverges if L > 1. These considerations 
yield the formula 


R = lim 

n —>oo 


a 


n 


tln+l 


if this limit exists (we put R = °° if | a n /a n+1 | —>• °°). Regardless of whether 
this formula can be used or not, it is known that R always exists; 
and if R is finite and nonzero, then it determines an interval of con¬ 
vergence -R<x<R such that inside the interval the series converges 
and outside the interval it diverges. A power series may or may not 
converge at either endpoint of its interval of convergence. 

C. Suppose that (3) converges for |x|<R with R> 0, and denote its sum 
by f(x): 


f(x) = a n x n = fl 0 + a iX + a 2 x 2 + ■■■. (8) 

n =0 


Then f(x) is automatically continuous and has derivatives of all 
orders for |x|<R. Also, the series can be differentiated termwise in 
the sense that 


oo 

f'(x) = ^ na n x n ~ l = a x + 2a 2 x + 3 a 3 x 2 + ■■■, 

n =1 


f"(x) = ^ n(n -1 )a n x n 2 = la 2 + 3• 2 a 3 x + ■■■, 

n =2 


and so on, and each of the resulting series converges for |x|<R. 
These successive differentiated series yield the following basic for¬ 
mula linking the a n to fix) and its derivatives: 


_ / (n) ( 0 ) 

n\ 


(9) 


Furthermore, it is often useful to know that the series (8) can be 
integrated termwise provided the limits of integration lie inside the 
interval of convergence. 
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If we have a second power series in x that converges to a function 
g(x) for | x | < R, so that 

00 

g(x) = ^ b n x n = b 0 + b\X + b 2 x 2 +■■■, (10) 

n =0 

then (8) and (10) can be added or subtracted termwise: 

oo 

f(x)±g(x) = ^( a n ±b n )x n =(a 0 ±b 0 )+(a 1 ±b 1 )x + ---. 

n =0 


They can also be multiplied as if they were polynomials, in the sense 
that 


f(x)g(x) = 

n=0 


where c tl -a 0 b n + a l b n _ 1 + ■■■ +a n b 0 . 2 If it happens that both series con¬ 
verge to the same function, so that f(x)-g(x) for |x|< R, then for¬ 
mula (9) implies that they must have the same coefficients: a 0 -b 0 , 
a 1 -b 1/ .... In particular, if f(x)-0 for |x|<_R, then a 0 -0, flj = 0, .... 

D. Let f(x) be a continuous function that has derivatives of all orders for 
| x | < R with R > 0. Ca n fix) be represented by a power series? If we use 
(9) to define the a n , then it is natural to hope that the expansion 

f(x) = J x n = /(0) + f'(Q)x + ^ + • • • (11) 

n=0 n * 


will hold throughout the interval. This is often true, but unfortu¬ 
nately it is sometimes false. One way of investigating the validity of 
this expansion for a specific point x in the interval is to use Taylor’s 
formula-. 


2 It will be useful later to notice that c„ can be written in two equivalent forms: 
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/M = Z 


k= 0 




where the remainder R„(x) is given by 


R n (x) = 


/ ( ” +1) (x) T „ +1 

(n + 1 )! 


for some point x between 0 and x. To verify (11), it suffices to show 
that R„(x) ->• 0 as « —By means of this procedure, it is quite easy to 
obtain the following familiar expansions, which are valid for all x: 


go n 1 s 

V x 1 X~ X 

e => — = l + x + — + — + •••; 

^n\ 2! 3! 


n=0 


( 12 ) 


sinx = ^(-l) n 

n =0 


x 2n+1 

(2n + 1)! 




(13) 


cosx = ^(-1)" 

n =0 


(2 n)\ 




(14) 


If a specific convergent power series is given to us, how can we rec¬ 
ognize the function that is its sum? In general it is impossible to do 
this, for very few power series have sums that are familiar elemen¬ 
tary functions. 

E. A function/(x) with the property that a power series expansion of 
the form 


/(x) = y>„(x-x 0 y (15) 

n =0 


is valid in some neighborhood of the point x 0 is said to be analytic at 
x 0 . In this case the a n are necessarily given by 


a 


n 


f {n \x o) 

. / 


and (15) is called the Taylor series of/(x) at x 0 . Thus, (12), (13), and (14) 
tell us that e x , sin x, and cos x are analytic at x 0 = 0, and the given series 
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are the Taylor series of these functions at this point. Most questions 
about analyticity can be answered by means of the following facts: 

1. Polynomials and the functions e x , sin x, and cos x are analytic at all 
points. 

2. If/(x) and g(x) are analytic at x 0 , then/(x) +g(x),f(x)g(x), and f(x)/g(x) [if 
g(x 0 ) # 0] are also analytic at x 0 . 

3. If/(x) is analytic at x 0 and f ] (x) is a continuous inverse, then/ '(x) is 
analytic at/(x 0 ) if f'(x 0 ) / 0. 

4. If g(x) is analytic at x 0 and /(x) is analytic at y(x 0 ), then f{g(xj) is ana¬ 
lytic at x 0 . 

5. The sum of a power series is analytic at all points inside the interval 
of convergence. 

Some of these statements are quite easy to prove by elementary methods, but 
others are not. Generally speaking, the behavior of analytic functions can be 
fully understood only in the broader context of the theory of functions of a 
complex variable. 


Problems 

1. Use the ratio test to verify that R = 0,R-°°, and R = 1 for the series (5), (6), 
and (7). 

2. If p is not zero or a positive integer, show that the series 

y P(p-l)(p-2)---(p-n + V) x „ 

^ n\ 

n =1 

converges for | x | < 1 and diverges for | x | > 1. 

3 . Show that R = °° for the series on the right sides of expansions (13) 
and (14). 

4 . Use Taylor's formula to establish the validity of the expansions (12), (13), 
and (14) for all x. Hint: a"/n\ ->■ 0 for every constant a (why?). 

5. It is well known from elementary algebra that 

T 2 n 1-X" +1 

l + X + X +-” + X = - 


1 — X 


if x* 1. 
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Use this to show that the expansions 
1 


1-x 


= l + x + x 2 + x 3 + 


and 


1+x 


-i 2 3 

-=l-x+x -x + 


are valid for | x | < 1. Apply the latter to show that 


x 2 x 3 x 4 

log(l + x) = x- — + -— — + • 
6 2 3 4 


and 


757 
1 X X X 

tan x = x-+- h ■ 

3 5 7 


for | x | < 1. 

6. Use the first expansion given in Problem 5 to find the power series for 
1/(1-x) 2 

(a) by squaring; 

(b) by differentiating. 

7. (a) Show that the series for cos x, 

1 x 2 x 4 x 6 

J 1-2 1-2-3-4 1-2-3-4-5-6 

has the property that y"-~y, and is therefore a solution of equation (1). 
(b) Show that the series 

x 2 x 4 x 6 

y ~ 1 _ 2 ^ + 2 2 .4 2 ~ 2 2 • 4 2 • 6 2 + ”' 

converges for all x, and verify that it is a solution of equation (2). 
[Observe that this series can be obtained from the one in (a) by replac¬ 
ing each odd factor in the denominators by the next greater even num¬ 
ber. The sum of this series is a useful special function denoted by J 0 (x) 
and called the Bessel function of order 0; it will be studied in detail in 
Chapter 8.] 
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27 Series Solutions of First Order Equations 

We have repeatedly emphasized that many interesting and important dif¬ 
ferential equations cannot be solved by any of the methods discussed in ear¬ 
lier chapters, and also that solutions for equations of this kind can often be 
found in terms of power series. Our purpose in this section is to explain the 
procedure by showing how it works in the case of first order equations that 
are easy to solve by elementary methods. 

As our first example, we consider the equation 


y'=y- (!) 

We assume that this equation has a power series solution of the form 

y=a 0 + a l x + a 2 x 1 + ■■■ +a n x n + (2) 

that converges for | x\<R with R> 0; that is, we assume that (1) has a solution 
that is analytic at the origin. A power series can be differentiated term by 
term in its interval of convergence, so 

y'-a l + 2a 2 x + 3a 3 x 1 + ■ ■ ■ + {n + l)a n+1 x n + ■■■. (3) 

Since y'=y, the series (2) and (3) must have the same coefficients: 

®i — ®(v 2a 2 = flj, 3 a 3 — a 2 , {n + l)a„ +1 = a n ,.... 

These equations enable us to express each a n in terms of a 0 : 


— a 0 , 



fo_ 

2’ 



a o 

2A'" 


a,i — 


fo_ 

n\‘ 


When these coefficients are inserted in (2), we obtain our power series 
solution 


y = a o 


f x 2 ^3 x n \ 

1 + X + — + -+ ••• + — + • 

v 2! 3! n\ J 


(4) 


where no condition is imposed on a 0 . It is essential to understand that so far 
this solution is only tentative, because we have no guarantee that (1) actually 
has a power series solution of the form (2). The above argument shows only 
that if (1) has such a solution, then that solution must be (4). However, it fol¬ 
lows at once from the ratio test that the series in (4) converges for all x, so the 
term-by-term differentiation is valid and (4) really is a solution of (1). In this 
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case we can easily recognize the series in (4) as the power series expansion of 
e x , so (4) can be written as 


y=a 0 e x . 

Needless to say, we can get this solution directly from (1) by separating vari¬ 
ables and integrating. Nevertheless, it is important to realize that (4) would 
still be a perfectly respectable solution even if (1) were unsolvable by elemen¬ 
tary methods and the series in (4) could not be recognized as the expansion 
of a familiar function. 

This example suggests a useful method for obtaining the power series 
expansion of a given function: find the differential equation satisfied by the 
function, and then solve this equation by power series. 

As an illustration of this idea we consider the function 

y = (l+x)f I , (5) 

where p is an arbitrary constant. It is easy to see that (5) is the indicated par¬ 
ticular solution of the following differential equation: 

(1 +x)y'=py, y(0) = 1. (6) 

As before, we assume that (6) has a power series solution 

y-a 0 + a l x + a 2 x 1 + ■■■ +a n x"+ ■■■ (7) 

with positive radius of convergence. It follows from this that 

y' = ai + 2a 2 x + 3 a 3 x 2 +••• + (« + l)a n+ iX n + ■■■, 
xy' = aix + la 2 x 2 + --- + na, l x n +•••, 
py = pa 0 + pa t x + pa 2 x 2 + • • • + pa n x n +■■■. 


By equation (6), the sum of the first two series must equal the third, so equat¬ 
ing the coefficients of successive power of x gives 

a 7 — pa 0 , la 2 -t a 7 — pa 3 a^ ~t 2a 2 — pa 2f ..., 

(n + 1) a n+1 + na n = pa„, .... 

The initial condition in (6) implies that a 0 = 1, so 

7 _ _«i(P-1) _ HP- 1) 

2 2 


„ a 2 (p- 2) p(p-l)(p-2) 

«3 ---- 


3 


2-3 


/•••/ 
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„ _ p(p-l)(p-2)---(p-n + l) 

U n — / • •• ■ 

n\ 

With these coefficients, (7) becomes 
y=l 

y K 2! 3! 

, P(P-1)(P~2)-(P-W + 1) , (g) 

n\ 

To conclude that (8) actually is the desired solution, it suffices to observe that 
this series converges for |x|<l (see Problem 26-2). On comparing the two 
solutions (5) and (8), and using the fact that (6) has only one solution, we have 

(l + x) f ' = l + px+ ^ X 2 + -" 

pfr-DHp-x+l) 

n\ v ’ 

for | x | < 1. This expansion is called the binomial series, and generalizes the 
binomial theorem to the case of an arbitrary exponent. 3 


Problems 

1. Consider the following differential equations: 

(a) y' - 2xy; 

(b) y' + y = 1. 


3 As the reader will recall from elementary algebra, the binomial theorem states that if n is a 
positive integer, then 

(1 + xT =l + nx + n(n ~ 1) x 2 +- + n(n ~ 1) "; {n ~ k + 1) x k + - + x". 


2 ! 


k\ 


More concisely. 




k=0 [f J 


XT, 


where the binomial coefficient 



is defined by 

n! _ n(n-T)---(n-k + l) 
kl(n-k)l 


k 


k\ 
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In each case, find a power series solution of the form a n x", try to 
recognize the resulting series as the expansion of a familiar function, 
and verify your conclusion by solving the equation directly. 

2. Consider the following differential equations: 

(a) xij'=y; 

(b) x 2 y'=y. 

In each case, find a power series solution of the form ' S ^ a„x", solve the 
equation directly, and explain any discrepancies that arise. 

3. Express sin 1 x in the form of a power series 'y'a n x" by solving y' = 

(1 - x 2 )~ 1/2 in two ways. (Hint: Remember the binomial series.) Use this 
result to obtain the formula 


7i_l 11 1-3 1 1-3-5 1 

6 2 2 3-2 3 2-4 5• 2 5 2-4-6 7 -2 7 

4. The differential equations considered in the text and preceding prob¬ 
lems are all linear. The equation 

y'=i +y 2 0 

is nonlinear, and it is easy to see directly that y = tan x is the particular 
solution for which y( 0) = 0. Show that 

1 3 2 5 

tanx = x +—x h- x +••• 

3 15 

by assuming a solution for equation (*) in the form of a power series 
and finding the a„ in two ways: 

(a) by the method of the examples in the text (note particularly how the 
nonlinearity of the equation complicates the formulas); 

(b) by differentiating equation (*) repeatedly to obtain 

y" = 2yy',y"' = 2yy" + 2(y') 2 ,..., 

and using the formula a n -f n \0)/n\. 

5. Solve the equation 


y'=x-y, y(0)=0 

by each of the methods suggested in Problem 4. What familiar function 
does the resulting series represent? Verify your conclusion by solving 
the equation directly as a first order linear equation. 
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28 Second Order Linear Equations. Ordinary Points 

We now turn our attention to the general homogeneous second order linear 
equation 


y" + P(x)y' + Q(x)y = 0. (1) 

As we know, it is occasionally possible to solve such an equation in terms of 
familiar elementary functions. This is true, for instance, when P(x) and Q(x) 
are constants, and in a few other cases as well. For the most part, however, 
the equations of this type having the greatest significance in both pure and 
applied mathematics are beyond the reach of elementary methods, and can 
only be solved by means of power series. 

The central fact about equation (1) is that the behavior of its solutions near 
a point x 0 depends on the behavior of its coefficient functions P(x) and Q(x) 
near this point. In this section we confine ourselves to the case in which P(x) 
and Q(x) are "well behaved" in the sense of being analytic at x 0 , which means 
that each has a power series expansion valid in some neighborhood of this 
point. In this case x 0 is called an ordinary point of equation (1), and it turns 
out that every solution of the equation is also analytic at this point. In other 
words, the analyticity of the coefficients of (1) at a certain point implies that 
its solutions are also analytic there. Any point that is not an ordinary point 
of (1) is called a singular point. 

We shall prove the statement made in the above paragraph, but first we 
consider some illustrative examples. 

In the case of the familiar equation 


y"+y=0, (2) 

the coefficient functions are P(x) = 0 and Q(x) = 1, These functions are analytic 
at all points, so we seek a solution of the form 

y=a Q + a l x + a 2 x 1 + ■■■ +a n x n + (3) 

Differentiating (3) yields 

y' = flj + 2 a 2 x + 3 a 3 x 2 +••• + (« + l)a„ +1 x" + • • • (4) 


and 


y"-2a 2 + 2 ■ 3a 3 x + 3 ■ 4a 4 x 2 + ■■• + (n + l)(n + 2)a n+2 x n + ■ ■ ■. 


(5) 
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If we substitute (5) and (3) into (2) and add the two series term by term, 
we get 


(la 2 + a 0 ) + (2 • 3 a 3 + a 2 )x + (3 ■ 4 a i + a 2 )x 2 + (4 ■ 5a s + a 3 )x 3 
+ ■ ■ • + K« +1)(« + 2)«„ +2 + a„]x" + ■ ■ • - 0; 

and equating to zero the coefficients of successive powers of x gives 

2a 2 + a 0 = 0, 2-3fl 3 + fl 1 = 0, 3-4a 4 + fl 2 = 0, 

4 • 5a 5 + a 3 = 0,..., (n + l)(n+2)a„ +2 + a„-0,... . 

By means of these equations we can express a n in terms of a 0 or a v according 
as n is even or odd: 


a 2 = - 


a o 

2 ' 


a 3 =■ 


Jh_ 

2-3' 


#4 — ■ 


CI2 

' 3^4 


a 0 

2-3-4 7 


_ a i 

4d5 “ 2-3-4-5'"" 


With these coefficients, (3) becomes 


y = a 0 +a 1 x- — x 1 l —x 3 + ———x 4 h - —- x 5 ■ 

J 2 2-3 2-3-4 2-3-4-5 


— «o 


f 2 4 

. X X 

1—+- 

v 2! 4! 


\ / 3 5 \ 

x J x 2 

X -+- 


+ U\ 


3! 5! 


( 6 ) 


Let t/|(x) and y 2 (x) denote the two series in parentheses. We have shown for¬ 
mally that (6) satisfies (2) for any two constants a 0 and a v In particular, by 
choosing a 0 = 1 and q^Owe see that y, satisfies this equation, and the choice 
fl 0 =0 and «j = l shows that y 2 also satisfies the equation. Just as in the examples 
of the previous section, the only remaining issue concerns the convergence of 
the two series defining y 1 and y 2 . But the ratio test shows at once that each 
of these series—and therefore the series (6)—converges for all x (see Problem 
26-3). It follows that all the operations performed on (3) are legitimate, so (6) 
is a valid solution of (2) as opposed to a merely formal solution. Furthermore, 
y 1 and y, are linearly independent since it is obvious that neither series is a 
constant multiple of the other. We therefore see that (6) is the general solution 










212 


Differential Equations with Applications and Historical Notes 


of (2), and that any particular solution is obtained by specifying the values 
of y(0) -a 0 and t/'(0) = a v 

In the above example the two series in parentheses are easily recognizable 
as the expansions of cos x and sin x, so (6) can be written in the form 

y=a 0 cos x+flj sin x. 

Naturally, this conclusion could have been foreseen in the beginning, since 
(2) is a very simple equation whose solutions are perfectly familiar to us. 
However, this result should be regarded as only a lucky accident, for most 
series solutions found in this way are quite impossible to identify and repre¬ 
sent previously unknown functions. 

As an illustration of this remark, we use the same procedure to solve 
Legendre's equation 


(1 - x 2 )y" -2xy' + p(p + l)y = 0, 

where p is a constant. It is clear that the coefficient functions 


P(x) = 


-2x 

1-x 2 


and 


Q(x) = 


p(p + i) 
l-x 2 


(7) 


( 8 ) 


are analytic at the origin. The origin is therefore an ordinary point, and 
we expect a solution of the form y = ^^fl„x". Since y' = + lK+ix”, we 

get the following expansions for the individual terms on the left side of 
equation (7): 


y" = X/” ++2 ) fl ” +2X " 

-x 2 y" = — (n — l)nfl„x", 

-2xy' = ^ - 2 na„x” 


and 

P(P + % =^p(p + l)a n x n . 


By equation (7), the sum of these series is required to be zero, so the coef¬ 
ficient of x" must be zero for every n: 

(n + l)(n + l)a n+2 - (n -1 )na„ - 2 na„ + p(p +1 )a n - 0. 
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With a little manipulation, this becomes 


(p-n)(p + n + 1)„ 

U n+2 ~ -;- 777 -—- a n 

(n + l)(n + 2) 


(9) 


Just as in the previous example, this recursion formula enables us to express a n 
in terms of a 0 or a ] according as n is even or odd: 

a - P^ + 1) a 
a 2 - — fl 0/ 

_ (p-l)(p + 2) 

« 3 - — a lr 

(P — 2)(p + 3) ^ _ p(p-2)(p + l)(p + 3) ^ 

4 ~ 3^4 2 ~~ 4 ! a °' 

(P - 3)(p + 4) ^ (p - l)(p - 3)(p + 2)(p + 4) ^ 

" 5 “ 4-5 fl3 " 5 ! 8l ' 


_ (p — 4)(p + 5) 

a 6 - - a 4 

5-6 

P(p-2)(p-4)(p + l)(p + 3)(p + 5) 
6 ! 


#0' 


- . (P“5)(P + 6)„ 

fl7 “ 6 ^ " 5 

(p-l)(P~3)(p-5)(p + 2)(p + 4)(p + 6) 
7! 


fl i/ 


and so on. By inserting these coefficients into the assumed solution 
y = ^Ta„x n , we obtain 


y = a 0 1 _ P(P + 1) x 2 + P(P-2)(P + 1)(P + 3) ^ 
7 2! 4! 

P(P ~ 2 )(P ~ 4)(p + l)(p + 3)(p + 5) | 

6 ! 


+ 


x _ (p~l)(p + 2) ^3 + (p-l)(p-3)(p + 2)(p + 4) ^5 


3! 


5! 


(p - l)(p - 3)(p -5)(p + 2)(p + 4)(p + 6) ^ | 
7! 


( 10 ) 


as our formal solution of (7). 
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When p is not an integer, each series in brackets has radius of convergence 
R= 1. This is most easily seen by using the recursion formula (9): for the first 
series, this formula (with n replaced by 2 n) yields 


n r 2n+2 

“In+lX 


(p-2n)(p + 2n + l) 

a r 2n 

U2nX 


(2n + l)(2n + 2) 


i2 i2 

pc —> pc 


as n —► °°, and similarly for the second series. As before, the fact that each 
series has positive radius of convergence justifies the operations we have per¬ 
formed and shows that (10) is a valid solution of (7) for every choice of the 
constants a 0 and a v Each bracketed series is a particular solution; and since 
it is clear that the functions defined by these series are linearly independent, 
(10) is the general solution of (7) on the interval |x|<l. 

The functions defined by (10) are called Legendre functions, and in general 
they are not elementary. However, when p is a nonnegative integer, one of 
the series terminates and is thus a polynomial—the first series if p is even 
and the second series if p is odd—while the other does not and remains 
an infinite series. This observation leads to the particular solutions of (7) 
known as Legendre polynomials, whose properties and applications we dis¬ 
cuss in Chapter 8. 

We now apply the method of these examples to establish the following 
general theorem about the nature of solutions near ordinary points. 


Theorem A. Let x 0 be an ordinary point of the differential equation 

y" + P(x)y' + Q(x)y = 0, (11) 

and let a 0 and a 1 be arbitrary constants. Then there exists a unique function y(x) 
that is analytic at x 0 , is a solution of equation (11) in a certain neighborhood of this 
point, and satisfies the initial conditions y(x 0 ) = a 0 and y'(xf)-a v Furthermore, if 
the power series expansions ofP(x) and Q(x) are valid on an interval \ x - x 0 1 < R, 
R> 0, then the power series expansion of this solution is also valid on the same 
interval. 

Proof. For the sake of convenience, we restrict our argument to the case in 
which x 0 = 0. This permits us to work with power series in x rather than x - x 0 , 
and involves no real loss of generality. With this slight simplification, the 
hypothesis of the theorem is that P(x) and Q(x) are analytic at the origin and 
therefore have power series expansions 


P(x) = ^ p n x 11 =p 0 + pyx + p 2 x 2 + ■■■ 

n =0 


( 12 ) 
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and 


QO) = Y (?„*" =q 0 + cpx + q 2 x 2 +■■■ (13) 

n= 0 


that converge on an interval |x|<_R for some R> 0. Keeping in mind the 
specified initial conditions, we try to find a solution for (11) in the form of 
a power series 


y = 

n =0 


= a 0 + apx + a 2 x 2 + ■■■ 


(14) 


with radius of convergence at least R. Differentiation of (14) yields 


y' = ’^T{n + l)a n+1 x n = + 2 a 2 x + 3 a 3 x 2 + ■■■ (15) 

n =0 


and 


y" = Y^ n + l)(n + 2)a n+1 x n 

n =0 


— la 2 + 2 ■ 3a 3 x + 3 • 4^4X + * • *. 


(16) 


It now follows from the rule for multiplying power series that 


P(x)y' = ^Jn + l)a n+1 x n 

V n= 0 J _ n= 0 

co n 

= J' j Y p„- k (k + \)ak. 


n =0 k =0 


(17) 


and 


r 


Q(x)y = 


\ / 


\ 


YjlnX" Yf 


a n x 

V n= 0 J V n= 0 7 

f n \ 

x n . 


Yj Y $»-k a k 

n =0 V k=Q 


7 


( 18 ) 
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On substituting (16), (17), and (18) into (11) and adding the series term by 
term, we obtain 


^ (n + l)(n+2)a n+2 + 'Sj? n _ k {k + l)a k+1 + 

n=0 k =0 k =0 


k^k 


x" = 0, 


so we have the following recursion formula for the a n : 

n 

(n +1 ){n + 2)a n+1 = -~^J(k + l)p n - k a k+ i + q n -kHk\- (19) 

k =0 

For n = 0,1,2, ... this formula becomes 

2a 2 — — (Po a i + %a 0 ), 

2 ■ 3a 3 — —(p 1 a l + 2p 0 a 2 + q 1 a 0 + q 0 af, 

3 ■ 4 a^ — (fh^i 3p 0 a 3 “h q 2 a^ ^ 0 ^ 2 )/ 


These formulas determine a 2 , a 3 , ... in terms of a 0 and a v so the resulting 
series (14), which formally satisfies (11) and the given initial conditions, is 
uniquely determined by these requirements. 

Suppose now that we can prove that the series (14), with its coefficients 
defined by formula (19), actually converges for |x|<iT Then by the general 
theory of power series it will follow that the formal operations by which 
(14) was made to satisfy (11)—termwise differentiation, multiplication, and 
term-by-term addition—are justified, and the proof will be complete. This 
argument is not easy. We give the details in Appendix A, where they can be 
omitted conveniently by any reader who wishes to do so. 

A few final remarks are in order. In our examples we encountered only 
what are known as two-term recursion formidas for the coefficients of the 
unknown series solutions. The simplicity of these formulas makes it fairly 
easy to determine the general terms of the resulting series and to obtain 
precise information about their radii of convergence. However, it is appar¬ 
ent from formula (19) that this simplicity is not to be expected in general. 
In most cases the best we can do is to find the radii of convergence of the 
series expansions of P(x) and Q(x) and to conclude from the theorem that 
the radius for the series solution must be at least as large as the smaller of 
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these numbers. Thus, for Legendre's equation it is clear from (8) and the 
familiar expansion 


1 -i 2 4 n -i 

-=- = l+x + x + •••, R- 1, 

1-x * 1 2 3 4 

that R = 1 for both P(x) and Q(x). We therefore know at once, without further 
calculation, that any solution of the form y = ' S * S^a n x" must be valid at least 
on the interval I x I < 1. 


Problems 

1. Find the general solution of (l+x 2 )y" + 2xy'-2y = 0 in terms of power 
series in x. Can you express this solution by means of elementary 
functions? 

2. Consider the equation y" + xy’ + y = 0. 

(a) Find its general solution y = ^~\;„x" in the form y-a 0 y r {x) +ay]\(x), 
where y,(x) and y 2 (x) are power series. 

(b) Use the ratio test to verify that the two series y,(x) and y 2 (x) converge 
for all x, as Theorem A asserts. 

(c) Show that y { (x) is the series expansion of e , use this fact to find a 

second independent solution by the method of Section 16, and con¬ 
vince yourself that this second solution is the function y 2 (x) found 
in (a). 

3. Verify that the equation y" + y' - xy = 0 has a three-term recursion for¬ 
mula, and find its series solutions y,(x) and y 2 (x) such that 

(a) yi(0) = l, yl(0) = 0; 

(b) y 2 (0) = 0, yi(0) = l. 

Theorem A guarantees that both series converge for all x. Notice 
how difficult this would be to prove by working with the series 
themselves. 

4. The equation y" + (p +1 - \x 7 )y = 0, where p is a constant, certainly has a 

series solution of the form y = '^'a n x n . 

(a) Show that the coefficients a„ are related by the three-term recursion 

formula 


(n +1 )(n + 2 )a n+2 + 



a n ——fl „-2 — 0. 
4 
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(b) If the dependent variable is changed from y to w by means 
of y = we 1 /4 , show that the equation is transformed into 
w" - xw' +pw = 0. 

(c) Verify that the equation in (b) has a two-term recursion formula 
and find its general solution. 

5. Solutions of Airy’s equation y" +xy = 0 are called Airy functions, and 

have applications to the theory of diffraction. 4 

(a) Apply the theorems of Section 24 to verify that every nontrivial 
Airy function has infinitely many positive zeros and at most one 
negative zero. 

(b) Find the Airy functions in the form of power series, and verify 
directly that these series converge for all x. 

(c) Use the results of (b) to write down the general solution of y" - xy = 0 
without calculation. 

6. Chebyshev’s equation is 


(l-x 2 )xy"-xiy' + p 2 y=0 / 


where p is a constant. 

(a) Find two linearly independent series solutions valid for | x | < 1. 

(b) Show that if p = n where n is an integer > 0, then there is a polyno¬ 
mial solution of degree n. When these are multiplied by suitable 
constants, they are called the Chebyshev polynomials. We shall return 
to this topic in the problems of Section 31 and in Appendix D. 

7. Hermite's equation is 


y"-2xy’ + 2py = 0, 


where p is a constant. 

(a) Show that its general solution is y(x) =a 0 yfx) + a- ] y 2 (x), where 

y, M =1 ■-,y» + ^ :2 > - 2 Ai!Lme - +... 

J 2! 4! 6! 


4 Sir George Biddell Airy (1801-1892), Astronomer Royal of England for many years, was a 
hard-working, systematic plodder whose sense of decorum almost deprived John Couch 
Adams of credit for discovering the planet Neptune. As a boy Airy was notorious for his 
skill in designing peashooters; but in spite of this promising start and some early work in the 
theory of light—in connection with which he was the first to draw attention to the defect of 
vision known as astigmatism—he developed into the excessively practical type of scientist 
who is obsessed by elaborate numerical computations and has little use for general scientific 
ideas. 
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and 


y<x) = x -*PlA x s + 2 2 (p-l)(p-3) x5 
J 3! 5! 


2 3 (p-l)(p-3)(p-5) x? < 
7! 


By Theorem A, both series converge for all x. Verify this directly. 


(b) 


If p is a nonnegative integer, then one of these series terminates 
and is thus a polynomial— y , (x) if p is even, and y 2 (x) if p is odd— 
while the other remains an infinite series. Verify that for p = 0, 1, 


2 4 

2, 3, 4, 5, these polynomials are 1, x, 1 - 2x 2 , x —x 3 , 1 - 4x 2 + —x 4 , 
4,4= 3 3 


x — X' 

3 


H-X" 

15 


(c) It is clear that the only polynomial solutions of Hermite's equa¬ 
tion are constant multiples of the polynomials described in 
(b). Those constant multiples with the property that the terms con¬ 
taining the highest powers of x are of the form 2”x" are denoted 
by H n (x) and called the Hermite polynomials. Verify that H 0 (x) = l, 
Hj(x) = 2x, H 2 (x) = 4x 2 - 2, H 3 (x) = 8x 3 - 12x, H 4 (x) - 16x 4 - 48x 2 + 12, and 
H 5 (x) = 32x 5 - 160x 3 + 120x. 


(d) Verify that the polynomials listed in (c) are given by the general 
formula 


H„(x) = (-l)V 2 


d" 

- e 

dx n 


In Appendix B we show how the formula in (d) can be deduced 
from the series in (a), we prove several of the most useful prop¬ 
erties of the Hermite polynomials, and we show briefly how 
these polynominals arise in a fundamental problem of quantum 
mechanics. 


29 Regular Singular Points 

We recall that a point x 0 is a s ingular point of the differential equation 


y" + P(x)y' + Q(x)y = 0 


( 1 ) 
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if one or the other (or both) of the coefficient functions P(x ) and Q(x) fails to be 
analytic at x 0 . In this case the theorem and methods of the previous section 
do not apply, and new ideas are necessary if we wish to study the solutions 
of (1) near x 0 . This is a matter of considerable practical importance; for many 
differential equations that arise in physical problems have singular points, 
and the choice of physically appropriate solutions is often determined by 
their behavior near these points. Thus, while we might want to avoid the 
singular points of a differential equation, it is precisely these points that usu¬ 
ally demand particular attention. As a simple example, the origin is clearly 
a singular point of 


y" + 2 y'-^y = 0. 


It is easy to verify that y, =x and y 2 -x~ 2 are independent solutions for 
x > 0, so y = cpx + c 2 x~ 2 is the general solution on this interval. If we happen 
to be interested only in solutions that are bounded near the origin, then 
it is evident from this general solution that these are obtained by putting 
c 2 = 0. 

In general, there is very little that can be said about the solutions of (1) 
near the singular point x 0 . Fortunately, however, in most of the applications 
the singular points are rather "weak," in the sense that the coefficient func¬ 
tions are only mildly nonanalytic, and simple modifications of our previ¬ 
ous methods yield satisfactory solutions. These are the regular singular 
points, which are defined as follows. A singular point x 0 of equation (1) is 
said to be regular if the functions (x - x 0 ) P(x) and (x-x 0 ) 2 Q(x) are analytic, 
and irregidar otherwise. 5 Roughly speaking, this means that the singular¬ 
ity in P(x) cannot be worse than l/(x-x 0 ), and that in Q(x) cannot be worse 
than l/(x-x 0 ) 2 . 

If we consider Legendre's equation 28-(7) in the form 


y ~ 


2x 


1 — x 




it is clear that x = l and x = -l are singular points. The first is regular 
because 


(x-l)P(x) = 


2x 
x + 1 


and 


(x-1) 2 Q(x) = 


(x~l)p(p + l) 

x + 1 


5 This terminology follows a time-honored tradition in mathematics, according to which situa¬ 
tions that elude simple analysis are dismissed by such pejorative terms as "improper," "inad¬ 
missible," "degenerate," "irregular," and so on. 
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are analytic at x = 1, and the second is also regular for similar reasons. As 
another example, we mention Bessel's equation of order p, where p is a non¬ 
negative constant: 


x 2 y" + xy' + (x 2 - p 2 )y = 0. 

If this is written in the form 


( 2 ) 


1 x 2 — rf 

y" + -y' + - f—y = 0 , 

x x 

it is apparent that the origin is a regular singular point because 
xP(x) -1 and x 2 Q(x) -x 2 -p 2 

are analytic at x = 0. In the remainder of this chapter we will often use Bessel's 
equation as an illustrative example, and in Chapter 8 its solutions and their 
applications will be examined in considerable detail. 

Now let us try to understand the reasons behind the definition of a regu¬ 
lar singular point. To simplify matters, we may assume that the singular 
point x 0 is located at the origin; for if it is not, then we can always move it to 
the origin by changing the independent variable from x to x-x 0 . Our start¬ 
ing point is the fact that the general form of a function analytic at x = 0 is 
a 0 +a 1 x+a 1 x 2 + -.Asa consequence, the origin will certainly be a singular 
point of (1) if 


-P(x) — • • • h— 2~ "t- r b o + b\X + b 2 x + • • • 


r\/ \ C_2 C -1 2 

(*2(x) — 1 y H-h Co + C\X + C 2 X + • * v 

X X 

and at least one of the coefficients with negative subscripts is nonzero. The 
type of solution we are aiming at for (1), for reasons that will appear below, 
is a "quasi power series" of the form 

y = x m (a 0 + AjX + a 2 x 2 + ■ • •) 

= agX m + « 1 x m+1 + a 2 x m+2 + ■■■, (3) 

where the exponent m may be a negative integer, a fraction, or even an irra¬ 
tional real number. We will see in Problems 6 and 7 that two independent 
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solutions of this kind are possible only if the above expressions for P(x) and 
Q(x) do not contain, respectively, more than the first term or more than the 
first two terms to the left of the constant terms b 0 and c 0 . An equivalent state¬ 
ment is that xP(x) and x 2 Q(x) must be analytic at the origin; and according 
to the definition, this is precisely what is meant by saying that the singular 
point x = 0 is regular. 

The next question we attempt to answer is: where do we get the idea that 
series of the form (3) might be suitable solutions for equation (1) near the 
regular singular point x - 0? At this stage, the only second order linear equa¬ 
tion we can solve completely near a singular point is the Euler equation dis¬ 
cussed in Problem 17-5: 


x 2 y" +pxy' +qy = 0. 


(4) 


If this is written in the form 


\f + ?-y' + \y = §, (5) 

x x 

so that P(x) = p/x and Q(x) = q/x 2 , then it is clear that the origin is a regular 
singular point whenever the constants p and q are not both zero. The solu¬ 
tions of this equation provide a very suggestive bridge to the general case, so 
we briefly recall the details. The key to finding these solutions is the fact that 
changing the independent variable from xtoz = log x transforms (4) into an 
equation with constant coefficients. To carry out this process, we assume that 
x > 0 (so that z is a real variable) and write 

, _ dy _ dy dz _ dy 1 
^ dx dz dx dz x 


and 


,, = dh/ = _d_r 

dx 2 dx v dx J dz y x 2 J x dx\dz ) 

1 dy 1 d f dy dz _ 1 d 2 y 1 dy 
x 2 dz x dz\dz ) dx x 2 dz 2 x 2 dz 

When these expressions are inserted in (4), the transformed equation is 
clearly 


d 2 y 

dz 2 


+ (P-1) 


dy 

dz 


+ qy = 0, 


( 6 ) 
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whose auxiliary equation is 


m 2 + (p- 1 )m + q = 0. (7) 

If the roots of (7) are m 1 and m 2 , then we know that (6) has the following 
independent solutions: 

e" nz and e miz ifm 2 ^m 1 ; 
e mz and ze mz if m 2 = m 1 . 


Since e z -x, the corresponding pairs of solutions for (4) are 

x'" 1 and x mi if m 2 * m : ; 

x m and x’ m logx if m 2 = . 


If we seek solutions valid on the interval x < 0, we have only to change the 
variable to t = -x and solve the resulting equation for t> 0. 

We have presented this discussion of Euler's equation and its solutions for 
two reasons. First, we point out that the most general differential equation 
with a regular singular point at the origin is simply equation (5) with the 
constant numerators p and q replaced by power series: 


y + 


r po+pix+pix 1 +--- A 


y + 


f 2 \ 

q 0 + q 1 x + q 2 x +•■ 

\ x J 


y = 0 - 


(9) 


Second, if the transition from (5) to (9) is accomplished by replacing con¬ 
stants by power series, then it is natural to guess that the corresponding 
transition from (8) to the solutions of (9) might be accomplished by replacing 
power functions x m by series of the form (3). We therefore expect that (9) will 
have two independent solutions of the form (3), or perhaps one of this form 
and one of the form 


y = x m log x (a 0 +a l x+a 1 x 2 + ■••), (10) 

where we assume that x > 0. The next section will show that these are very 
good guesses. 

One final remark is necessary before we leave these generalities. Notice 
that if a 0 = 0 in expressions like (3) and (10), then some positive integral power 
of x can be factored out of the power series part and combined with x m . We 
therefore always assume that a 0 =£0 in such expressions; and this assump¬ 
tion means only that the highest possible power of x is understood to be 
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factored out before any calculations are performed. Series of the form (3) 
are called Frobenius series, and the procedure described below for finding 
solutions of this type is known as the method of Frobenius. 6 Frobenius series 
evidently include power series as special cases, whenever m is zero or a posi¬ 
tive integer. 

To illustrate the above ideas, we consider the equation 

1x 2 y" + x(2x + l)y'-y = 0. (11) 

If this is written in the more revealing form 


„ 1/2 + x , —1 / 2 n 

y +- y +—= o, 

X X 


( 12 ) 


then we see at once that xP(x) - — + x and x 2 Q(x) = so x = 0 is a regular sin¬ 
gular point. We now introduce our assumed Frobenius series solution 

y = x m (a 0 + a x x + a 2 x 2 + ■ ■ •) 

= floX’" + n 1 x m+1 + a 2 x m+2 +■■■, (13) 


and its derivatives 

y’ = a 0 mx m_1 + afm + l)x m + a 2 (m + 2)x m+1 + ■ ■ ■ 


and 


y"= a 0 m(m - l)x m ~ 2 + afm + l)mx'" _1 
+ a 2 (m + 2 )(m + l)x" ! + • ■ - . 

To find the coefficients in (13), we proceed in essentially the same way as in 
the case of an ordinary point, with the significant difference that now we 
must also find the appropriate value (or values) of the exponent m. When the 
three series above are inserted in (12) and the common factor x m ~ 2 is canceled, 
the result is 


6 Ferdinand Georg Frobenius (1849-1917) taught in Berlin and Zurich. Fie made several valu¬ 
able contributions to the theory of elliptic functions and differential equations. Flowever, his 
most influential work was in the field of algebra, where he invented and applied the impor¬ 
tant concept of group characters and proved a famous theorem about possible extensions of 
the complex number system. 
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a 0 m(m -1) + fli(m +1 )mx + a 2 (m +1 )(m + l)x * 2 + • • • 

++ x\a 0 m + ai(m + l)x + a 2 (m + 2)x 2 + • • •] 

1 . 

- — (fl 0 + a 2 x + a 2 x +•••) = 0. 

By inspection, we combine corresponding power of x and equate the 
coefficient of each power of x to zero. This yields the following system of 
equations: 


a o 


1 1 

m(m - 1 ) + — m - — 


«i 


1 1 
(m +1 )m + — (m + 1 ) - — 


a 2 


1 1 
(m + 2)(m + 1 ) + — (m + 2 ) - — 


= 0 , 

+ a 0 m = 0 , 

+ a 1 (m + 1 ) = 0 , 


(14) 


As we explained above, it is understood that a 0 / 0. It therefore follows from 
the first of these equations that 

1 1 

m(m-V) + — m- — = 0. (15) 

This is called the indicial equation of the differential equation (11). Its 
roots are 


ni\ = 1 and 


m 2 = 


i 

2 ' 


and these are only possible values for the exponent m in (13). For each of 
these values of m, we now use the remaining equations of (14) to calculate a v 
a 2 , ... in terms of a 0 . For m 1 = 1, we obtain 


«i =■ 


a 2 =- 




2-1 + —-2 — 
2 2 


2u ] 


3-2 +A. 3 . 

2 


2 

~ — a Qr 

5 


2 4 

—— — a 0 . 
7 35 
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And for m 2 =- , we obtain 


«i = 


«2 =- 


1 

-flo 


-- + 


— _ ^ 0 / 


1 

I" 1 


3 1 1 3 

2’2 + 2 ’ 2 ' 




We therefore have the following two Frobenius series solutions, in each of 
which we have put a 0 = 1 : 


i i 2 4 2 

t/i = x 1 — x + —X +• 

J 1 5 35 


(16) 


y 2 = x 


- 1/2 



(17) 


These solutions are clearly independent for x > 0, so the general solution of 
( 11 ) on this interval is 

r 2 4 2 1/2 f, i 2 'i 

t/ = CiX 1 — x + —x +••• +c 2 x 1 — x H—x + ••• . 

I 5 35 J l 2 J 

The problem of determining the interval of convergence for the two power 
series in parentheses will be discussed in the next section. 

If we look closely at the way in which (15) arises from (12), it is easy to see 
that the indicial equation of the more general differential equation (9) is 

m(m - 1 ) + mpo + q 0 - 0 . (18) 

In our example, the indicial equation had two distinct real roots leading to 
the two independent series solutions (16) and (17). It is natural to expect such 
a result whenever the indicial equation (18) has distinct real roots m 1 and 
m 2 . This turns out to be true if the difference between m 1 and m 2 is not an 
integer. If, however, this difference is an integer, then it often (but not always) 
happens that one of the two expected series solutions does not exist. In this 
case it is necessary—just as in the case wq = m 2 —to find a second independent 
solution by other methods. In the next section we investigate these difficul¬ 
ties in greater detail. 
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Problems 


1. For each of the following differential equations, locate and classify its 
singular points on the x-axis: 

(a) x 3 (x - l)y" - 2(x - l)y' + 3xy = 0; 

(b) x 2 (x 2 -l) 2 y"-x(l-x)y' + 2y = 0; 

(c) x 2 y" + (2 - x)y' = 0; 

(d) (3x + l)xy" - (x + l)y' + 2y = 0. 

2. Determine the nature of the point x = 0 for each of the following 
equations: 

(a) y" + (sin x)y = 0; 

(b) xy" + (sinx)y = 0; 

(c) x 2 y" + (sin x)y = 0; 

(d) x 3 y" + (sin x)y = 0; 

(e) x 4 y" + (sin x)y = 0. 

3. Find the indicial equation and its roots for each of the following dif¬ 
ferential equations: 

(a) x 3 y" + (cos 2x - l)y' + 2xy = 0; 

(b) 4x 2 y" + (2x 4 - 5x)y' + (3x 2 + 2)y = 0. 

4 . For each of the following equations, verify that the origin is a regu¬ 
lar singular point and calculate two independent Frobenius series 
solutions: 

(a) 4xy" + 2y' + y = 0; 

(b) 2xy" + (3-x)y'-y = 0; 

(c) 2xy" + (x + l)y' + 3y = 0; 

(d) 2x 2 y" + xy' - (x + l)y = 0. 

5. When p = 0, Bessel's equation (2) becomes 


x 2 y"+xy'+x 2 y = 0. 


Show that its indicial equation has only one root, and use the method 
of this section to deduce that 



is the corresponding Frobenius series solution [see Problem 26-7(b)]. 
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6. Consider the differential equation 


y - o. 


(a) Show that x - 0 is an irregular singular point. 

(b) Use the fact that y 1 =x is a solution to find a second independent 
solution y 2 by the method of Section 16. 

(c) Show that the second solution y 2 found in (b) cannot be expressed 
as a Frobenius series. 

7. Consider the differential equation 


y"+^y'+p = o, 


where p and q are nonzero real numbers and b and c are posi¬ 
tive integers. It is clear that x = 0 is an irregular singular point if 
b> 1 or c>2. 

(a) If b = 1 and c = 3, show that there is only one possible value of m for 
which there might exist a Frobenius series solution. 

(b) Show similarly that m satisfies a quadratic equation—and hence we 
can hope for two Frobenius series solutions, corresponding to the 
roots of this equation—if and only if b= 1 and c < 2. Observe that 
these are exactly the conditions that characterize i = 0asa "weak" 
or regular singular point as opposed to a "strong" or irregular sin¬ 
gular point. 

8. The differential equation 

x 2 y" + (3x- l)i/' + y=0 

has i = 0asan irregular singular point. If (3) is inserted into this equa¬ 
tion, show that m = 0 and the corresponding Frobenius series "solution" 
is the power series 


n =0 


which converges only at x-0. This demonstrates that even when a 
Frobenius series formally satisfies such an equation, it is not necessar¬ 
ily a valid solution. 
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30 Regular Singular Points (Continued) 

Our work in the previous section was mainly directed at motivation and 
technique. We now confront the theoretical side of the problem of solving the 
general second order linear equation 

y" + P(x)y' + Q{x)y = 0 (1) 

near the regular singular point x = 0. The ideas developed above suggest 
that we attempt a formal calculation of any solutions of (1) that have the 
Frobenius form 


y = x m (a 0 + apx + a 2 x 2 +■ ■ •), (2) 

where a 0 / 0 and m is a number to be determined. Our hope is that any for¬ 
mal solution that arises in this way can be legitimized by a proof and estab¬ 
lished as a valid solution. The generality of this approach will also serve to 
illuminate the circumstances under which equation (1) has only one solu¬ 
tion of the form (2). 7 For reasons already explained, we confine our attention 
to the interval x > 0. The behavior of solutions on the interval x < 0 can be 
studied by changing the variable to t - -x and solving the resulting equation 
for t > 0. 

Our hypothesis is that xP(x) and x 2 Q(x) are analytic at x = 0, and therefore 
have power series expansions 


xP(x) = ^Tp n x n and x 2 Q(x) = q„x n (3) 

n =0 n=0 


which are valid on an interval |x |<R for some R> 0. Just as in the example of 
the previous section, we must find the possible values of m in (2); and then, 
for each acceptable m, we must calculate the corresponding coefficients a 0 , a u 
a 2 , ... . If we write (2) in the form 


y = x ra Ya„x" = Yfl„x m+ ”, 

n =0 n =0 


7 When we say that (1) has "only one" solution of the form (2), we mean that a second indepen¬ 
dent solution of this form does not exist. 
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then differentiation yields 


y' = J'a n (m + n)x” 


n =o 


and 


y" = ^ ji n (m + n)(m + n-l)x m+n 2 

n=0 

CO 

= x m ~ 2 'y~'a n (m + n)(m + n — l)x n . 

n=0 

The terms P(x)y' and Q(x)y in (1) can now be written as 

' OO A CO 

V a n {rn + n)x m+n - 1 
: 0 J n= 0 

00 A 00 

^jp n x n y a n (m + n)x n 


and 


P(x)y' = - 

X 


= X 


= X 


= x" 


V n= 0 J L n=0 


oo n 


2 ^ y]p n - k a k (m + k) 

n=0 _k =0 
00 M 

y (m + k) + p 0 a n (m + n) 


n =0 fc=0 


/ 


Q(*)y = 


\ / 


\ 




= x 


a n x 

V «=o y v w=o j 

00 V 0° A 

^J] n X n ^Jl n X 

v»=o A«=o y 


= x m ^ ~j„- k a k 

n =0 V ^=0 7 

oo 7 n-1 

= x- 2 y ^Jjn-kClk + qoa n 

n=0 v ^=0 


\ 


X". 
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When these expressions for y", P(x)y' and Q(x)y are inserted in (1) and the 
common factor x m ~ 2 is canceled, then the differential equation becomes 


yj a„[(m + n)(m + n- 1) + (m + n)p 0 + q 0 ] 

n =0 [ 
n -1 

+^y k [(m + k)p n _ k + q n _ k ] 

k =0 



and equating to zero the coefficient of x" yields the following recursion for¬ 
mula for the a n : 


a n [(m + n)(m + n- 1) + (m + n)p 0 + tj 0 ] 

n —1 

+y\i k [(m + k)p„- k + q n - k ] = 0. (4) 

k=0 

On writing this out for the successive values of n, we get 


a 0 [m(m- 1) + mp 0 + q 0 ] = 0, 
a Aim +1 )m +(m +1 )p 0 + q 0 ] + a 0 (mpi + </i) = 0, 
a 2 [(ni + 2)(m + T) + (m + 2)po + qo\+ao{mp 2 + q 2 ) + ai[(m + l)pi +^i] = 0, 


a n [(m+n)(m+n-l)+(m+n)po + q 0 ]+a 0 (mp„ + q„)+---+a„- 1 [(m+n-l)p 1 + q 1 ] = 0. 


If we put/(wi) = m(m - 1) + mp 0 + q 0 , then these equations become 

a Q f(m)=0, 

ajim + 1 ) + a 0 (mp 1 + qj = 0 , 
a/(m + 2 ) + a 0 (mp 2 + q 2 ) +«i[(wt +l)pi + qA = 0 , 

a„f(m + n) + a 0 (mp n + q n ) + ■ ■ ■ + a n _A(m + n-V)p 1 + qA = 0 , 

Since « 0 /0, we conclude from the first of these equations that f(m)=0 or, 
equivalently, that 


m(m -1) + mp 0 + q 0 - 0. 


(5) 
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This is the indicial equation, and its roots m, and m 2 —which are possible values 
for m in our assumed solution (2)—are called the exponents of the differential 
equation (1) at the regular singular point x = 0. The following equations give 
in terms of a 0 , a 2 in terms of a 0 and a t and so on. The a„ are therefore deter¬ 
mined in terms of a 0 for each choice of m unless f(m + n) = 0 for some positive 
integer n, in which case the process breaks off. Thus, if m 1 = m 2 + n for some 
integer n > 1, the choice m = m 1 gives a formal solution but in general m = m 2 
does not—since f(m 2 + n)=f(mf = 0. If m 1 = m 2 we also obtain only one formal 
solution. In all other cases where rn t and m 2 are real numbers, this procedure 
yields two independent formal solutions. It is possible, of course, for m t and 
m 2 to be conjugate complex numbers, but we do not discuss this case because 
an adequate treatment would lead us too far into complex analysis. The spe¬ 
cific difficulty here is that if the m's are allowed to be complex, then the a n 
will also be complex, and we do not assume that the reader is familiar with 
power series having complex coefficients. 

These ideas are formulated more precisely in the following theorem. 


Theorem A. Assume that x = 0 is a regular singular point of the differential equa¬ 
tion (1) and that the power series expansions (3) ofxP(x) and x 2 Q(x) are valid on an 
interval |x|<R zvith R> 0. Let the indicial equation (5) have real roots m 1 and m 2 
with m 2 < m v Then equation (1) has at least one solution 


i/i = (flo^O) (6) 

n =0 


on the interval 0 <x<R, where the a n are determined in terms ofa 0 by the recur slon 
formula (4) with m replaced by m v and the series ^^a„x" converges for |x|<R. 
Furthermore, ifm 1 -m 2 is not zero or a positive integer, then equation (1) has a sec¬ 
ond independent solution 


y 2 = x m2 ^a n x" (h 0 * 0) (7) 

n= 0 

on the same interval, zvhere in this case the a„ are determined in terms ofa 0 by for¬ 
mula (4) with m replaced by m 2 , and again the series ^ a n x n converges for |x|<R. 


In view of what we have already done, the proof of this theorem can be 
completed by showing that in each case the series a n x n converges on the 
interval |x|<R. Readers who are interested in the details of this argument 
will find them in Appendix A. We emphasize that in a specific problem it 
is much simpler to substitute the general Frobenius series (2) directly into 
the differential equation than to use the recursion formula (4) to calculate 
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the coefficients. This recursion formula finds its main application in the deli¬ 
cate convergence proof given in Appendix A. 

Theorem A unfortunately fails to answer the question of how to find a 
second solution when the difference m, - m 2 is zero or a positive integer. In 
order to convey an idea of the possibilities here, we distinguish three cases. 

CASE A. If m 1 -m 2 , there cannot exist a second Frobenius series solution. 

The other two cases, in both of which m 1 - m 2 is a positive integer, will be 
easier to grasp if we insert m = m 2 in the recursion formula (4) and write it as 


«/(m 2 + n) = -a 0 (m 2 p n + q,) - a n _ i[(m 2 + n-l)p 1 + q 1 ]. 


( 8 ) 


As we know, the difficulty in calculating the a n arises because f(m 2 +n) = 0 for 
a certain positive integer n. The next two cases deal with this problem. 

CASE B. If the right side of (8) is not zero when f(m 2 +n) = 0, then there is no 
possible way of continuing the calculation of the coefficients and there can¬ 
not exist a second Frobenius series solution. 

CASE C. If the right side of (8) happens to be zero when f(m 2 +n) = 0, then a n is 
unrestricted and can be assigned any value whatever. In particular, we can 
put a„ = 0 and continue to compute the coefficients without any further diffi¬ 
culties. Flence in this case there does exist a second Frobenius series solution. 

The problems below will demonstrate that each of these three possibilities 
actually occurs. 

The following calculations enable us to discover what form the second solu¬ 
tion takes when m 1 - m 2 is zero or a positive integer. We begin by defining a 
positive integer k by= m 1 - m 2 +1. The indicial equation (5) can be written as 


(m - m^)(m - m 2 ) = m 2 - (m l + m 2 )m + m 1 m 2 = 0, 


so equating the coefficients of m yields p 0 -l = -(m 1 + m 2 ) or m 2 =l-p 0 -m v 
and we have k = 2m 1 +p 0 . By using the method of Section 16, we can find a 
second solution y 2 from the known solution i/, = x m (a 0 + ape + • • •) by writing 
y 2 = vy v where 



1 


y i 

i 


f((po/x)+pi+--)dx 


x 2 ’" 1 (a 0 + apx + ■ ■ -) 2 


1 


(-pologx-pix—•) 


- £ 

x 2 ”' 1 ( fl 0 + fljX + • • -) 2 



= ( X ). 

X 
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The function g(x) defined by the last equality is clearly analytic at x = 0, with 
g( 0 ) = l/ a 2 , so in some interval about the origin we have 

g(x)-b 0 +b 1 x + b 2 x 2 + b 0 ^0. (9) 


It follows that 


v' = b 0 x~ k +b 1 x~ k+1 + ■ ■ ■ + b k _px~ l + b k + ■■■, 


so 


b 0 x k+1 byx k+2 

v = - + - + • • • + p t_i log x + b k x + ■ 

-k + 1 -k + 2 b 


and 


y 2 = y lV = XJx 


( b 0 x' k+1 


+ • • • + b k _ k logx + b k x + ■ 


v -k + 1 

= b k _ k yi log x + x "' 1 (fl 0 + «iX + • • •) 


r b 0 x- k+1 + ^ 

-k + 1 + "' 


If we factor x~ k+1 out of the series last written, use m 1 -k +1 = m 2 , and multiply 
the two remaining power series, then we obtain 


3/2 = &/C- 1 J /1 log x + x”' 2 Vc„x" 


( 10 ) 


n=o 


as our second solution. 

Formula (10) has only limited value as a practical tool; but it does yield 
several grains of information. First, if the exponents m 1 and m 2 are equal, 
then k= 1 and b k _ x -b 0 #0; so in this case—which is Case A above—the term 
containing log x is definitely present in the second solution (10). Flowever, 
if m 1 -m 2 = k-1 is a positive integer, then sometimes b k _ j#0 and the loga¬ 
rithmic term is present (Case B), and sometimes b k _ k - 0 and there is no loga¬ 
rithmic term (Case C). The practical difficulty here is that we cannot readily 
find b k _ k because we have no direct means of calculating the coefficients in 
(9). In any event, we at least know that in Cases A and B, when b k _ , # 0 and 
the method of Frobenius is only partly successful, the general form of a 
second solution is 


y 2 = 1/1 log x + x ” !2 ^ c„x”, 

n =0 


(li) 
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where the c„ are certain unknown constants that can be determined by sub¬ 
stituting (11) directly into the differential equation. Notice that this expres¬ 
sion is similar to formula 29-(10) but somewhat more complicated. 


Problems 

1. The equation 


x * 1 2 y" - 3 xy' + (4x + 4 )y = 0 


has only one Frobenius series solution. Find it. 

2 . The equation 

4x 2 y" - 8x 2 y' + (4x 2 + l)y = 0 


has only one Frobenius series solution. Find the general solution. 

3. Find two independent Frobenius series solutions of each of the follow¬ 
ing equations: 

(a) xy" + 2y'+xy = 0; 

(b) x 2 y"-x 2 y' + (x 2 -2)y = 0; 

(c) xy"-y' + 4x 3 4 i/ = 0. 

4. Bessel's equation of order p = 1 is 


x 2 y" + xy' + (x 2 - l)y = 0. 


Show that in t -m 2 = 2 and that the equation has only one Frobenius 
series solution. Then find it. 



Show that m 1 -m 2 = 1, but that nevertheless the equation has two inde¬ 
pendent Frobenius series solutions. Then find them. 

6 . The only Frobenius series solution of Bessel's equation of order p = 0 
is given in problem 29-5. By taking this as y v and substituting for¬ 
mula (11) into the differential equation, obtain the second independent 
solution 
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31 Gauss's Hypergeometric Equation 

This famous differential equation is 

x(l - x)y" + [c - (a + b +1 )x]y' - aby = 0, (1) 

where a, b, and c are constants. The coefficients of (1) may look rather strange, 
but we shall find that they are perfectly adapted to the use of its solutions in 
a wide variety of situations. The best way to understand this is to solve the 
equation for ourselves and see what happens. 

We have 


c-(«+b*l)x -«b 

x(l-x) x(l-x) 

so x = 0 and x = 1 are the only singular points on the x-axis. Also, 

xP(x) = -— + ^ + = [ c - (a + b + l)x](l + x + x 2 + ■■■) 

1-x 


and 


= c + [c-(a + b + l)]x + ■■■ 


x 2 Q(x) = a ^ X = -abx( 1 + x + x 2 + ■■■) 
1-x 


= -abx - abx 2 -, 


so x - 0 (and similarly x - 1) is a regular singular point. These expansion show 
that p 0 = c and q 0 = 0, so the indicial equation is 

m(m -l) + mc = 0 or m[m - (1 - c)] = 0 

and the exponents are /;/, =0 and m 2 - 1 - c. If 1 - c is not a positive integer, that 
is, if c is not zero or a negative integer, then Theorem 30-A guarantees that (1) 
has a solution of the form 


y = = a 0 + aix + a 2 x 2 + •••, (2) 

n =0 

where a 0 is a nonzero constant. On the substituting this into (1) and equating 
to zero the coefficient of x", we obtain the following recursion formula for 
the a n : 


®n+l 


(a + n)(b + n) 
(n+l)(c + n) 


(3) 
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We now set a n = 1 and calculate the other a„ in succession: 


«i = 


ab 

hc‘ 


a 2 = 


a(a + l)b(b + l) 

1 • 2c(c +1) ' 


«3 = 


a(a + 1 )(a + 2)b(b +l)(fr + 2) 
l-2-3c(c + l)(c + 2) ' 


With these coefficients, (2) becomes 


ab a(a + l)b(b + l) 2 

V = l +-x + —-—- -x 

J 1 • c 1 • 2c(c +1) 

, «(fl + l)(fl + 2)b(b + l)(b + 2) 3 | 

l-2-3c(c + l)(c + 2) 


n=i 


a(a + l)---(a + n-l)b(b + l)---(b + n-l) n 
n\c(c + l)---(c + n-l) 


(4) 


This is known as the hypergeometric series, and is denoted by the symbol 
F(a,b,c,x). It is called this because it generalizes the familiar geometric series 
as follows: when a = 1 and c = b, we obtain 

F(l, b, b, x) = 1 + x + x 2 + • • • =-. 

1-x 


If a or b is zero or a negative integer, the series (4) breaks off and is a poly¬ 
nomial; otherwise the ratio test shows that it converges for |x|< 1, since (3) 
gives 


«»+ iX n+1 


(a + n)(b + n) 

a„x n 


(n + \)(c + n) 


x —> x as n — > oo. 


This convergence behavior could also have been predicted from the fact that 
the singular point closest to the origin is x = 1. Accordingly, when c is not zero 
or a negative integer, F(a,b,c,x) is an analytic function—called the hypergeo¬ 
metric function —on the interval | x | < 1. It is the simplest particular solution 
of the hypergeometric equation. The hypergeometric function has a great 
many properties, of which the most obvious is that it is unaltered when a and 
b are interchanged: F(a,b,c,x) = F(b,a,c,x). 8 


A summary of some of its other properties can be found in A. Erdelyi (ed.). Higher 
Transcendental Functions, Vol. I, pp. 56-119, McGraw-Hill, New York, 1953. 
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If 1 - c is not zero or a negative integer—which means that c is not a posi¬ 
tive integer—then Theorem 30-A also tells us that there is a second indepen¬ 
dent solution of (1) near x = 0 with exponent m 2 = 1 -c. This solution can be 
found directly, by substituting 


y = x x ~ c (a 0 + ape + a 2 x 2 + ■ ■ •) 

into (1) and calculating the coefficients. It is more instructive, however, to 
change the dependent variable in (1) from y to z by writing 

y-x x ~ c z. 

When the necessary computations are performed—students should do this 
work themselves—equation (1) becomes 

x(l - x)z" + [(2 - c) - ([a - c + 1] + [b - c +1] + l)x]z' 

- (a - c +1 )(b - c + l)z=0, (5) 

which is the hypergeometric equation with the constants a, b, and c replaced 
by a - c +1, b - c +1, and 2 - c. We already know that (5) has the power series 
solution 


z = F(a - c + 1, b - c +1, 2 - c,x) 

near the origin, so our desired second solution is 

y = x 1-c F(a-c + l, b-c + 1, 2-c,x). 

Accordingly, when c is not an integer, we have 

y = c l F(a,b,c,x) + c 2 x x ~ c F(a -c +1, b-c + 1, 2 -c,x) (6) 

as the general solution of the hypergeometric equation near the singular 
point x = 0. 

In general, the above solution is only valid near the origin. We now solve 
(1) near the singular point x = 1. The simplest procedure is to obtain this solu¬ 
tion from the one already found, by introducing a new independent variable 
t = 1 - x. This makes x = 1 correspond to f = 0 and transforms (1) into 

f(l - t)y” + [(fl + b - c +1) - (a + b + \)t\y' - aby = 0, 

where the primes signify derivatives with respect to t. Since this is a hyper¬ 
geometric equation, its general solution near f = 0 can be written down at 
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once from (6), by replacing x by t and cby a + b-c + 1; and when t is replaced 
by 1 - x, we see that the general solution of (1) near x = 1 is 

y-c 1 F(a,b, a + b-c + 1,1 -x) 

+ c 2 (l - x) c ~ a ~ h F{c - b, c - a, c - a - b + 1,1 - x). (7) 

In this case it is necessary to assume that c-a-b is not an integer. 

Formulas (6) and (7) show that the adaptability of the constants in equation 
(1) makes it possible to express the general solution of this equation near each 
of its singular points in terms of the single function F. Much more than this 
is true, for these ideas are applicable to a wide class of differential equations. 
The key is to notice the following general features of the hypergeometric 
equation: that the coefficients of y", y', and y are polynomials of degrees 2, 
1, and 0, and also that the first of these polynomials has distinct real zeros. 
Any differential equation with these characteristics can be brought into the 
hypergeometric form by a linear change of the independent variable, and 
hence can be solved near its singular points in terms of the hypergeometric 
function. 

To make these remarks somewhat more concrete, we briefly consider the 
general equation of this type, 

(x-A)(x-B)y" + (C + Dx)i/' + £i/ = 0, (8) 

where A / B. If we change the independent variable from x to t by means of 


t = 


x-A 

B-A' 


then x= A corresponds to t - 0, and x = B to t = 1. With a little calculation, equa¬ 
tion (8) assumes the form 

f(l - t)y" + (F+ Gt)y' + Hy = 0, 

where F, G, and H are certain combinations of the constants in (8) and the 
primes indicate derivatives with respect to t. This is a hypergeometric equa¬ 
tion with a, b, and c defined by 

F = c, G - -(a + b + 1), H = -ab , 

and can therefore be solved near t = 0 and f = 1 in terms of the hypergeometric 
function. But this means that (8) can be solved in terms of the same function 
near x= A and x = B. 

The above ideas suggest the protean versatility of the hypergeometric func¬ 
tion F(a,b,c,x) in the field of differential equations. We will also see (in Problem 1) 
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that the flexibility afforded by the three constants a, b, and c allows the hyper¬ 
geometric function to include as special cases most of the familiar functions 
of elementary analysis. This function was known to Euler, who discovered a 
number of its properties; but it was first studied systematically in the context 
of the hypergeometric equation by Gauss, who in this connection gave the 
earliest satisfactory treatment of the convergence of an infinite series. Gauss's 
work was of great historical importance because it initiated far-reaching devel¬ 
opments in many branches of analysis—not only in infinite series, but also in 
the general theories of linear differential equations and functions of a complex 
variable. The hypergeometric function has retained its significance in modern 
mathematics because of its powerful unifying influence, since many of the 
principal special functions of higher analysis are also related to it. 9 


Problems 

1. Verify each of the following by examining the series expansions of the 
functions on the left sides: 


(a) (1 +x)P = F(-p,b,b,-x); 

(b) log(l+x)=xF(l,l,2, -x); 



It is also true that 




Satisfy yourself of the validity of these statements without attempting 
to justify the limit processes involved. 

2. Find the general solution of each of the following differential equations 
near the indicated singular point: 



9 A brief account of Gauss and his scientific work is given in Appendix C. 
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(b) (2x 2 + 2x)y" + (l+5x)y'+y = 0, x = 0; 

(c) (x 2 -l)y'' + (5x + 4)y'+4y = 0, x=-l; 

(d) (x 2 -x- 6 )y'' + (5 + 3x)y' + y = 0, x = 3. 

3. In Problem 28-6 we discussed Chebyshev's equation 
(1 - x 2 )y" - xy' + p 2 y = 0 , 

where p is a nonnegative constant. Transform it into a hypergeometric 

1 

equation by replacing x by t- —(1 - x), and show that its general solution 
near x = 1 is 


y = CiF p,-p, 


1 1 — x 


+ c 2 


1/2 


rl 1 13 1 — x 

f l P+ 2-- p+ 2'2'^ 


4. Consider the differential equation 

x(l - x)y" + [p - (p + 2 )x]y' - py= 0 , 
where p is a constant. 

(a) If p is not an integer, find the general solution near x = 0 in terms of 
hypergeometric functions. 

(b) Write the general solution found in (a) in terms of elementary 
functions. 

(c) When p = 1, the differential equation becomes 

x(l - x)y " + (1 - 3x)y' - y=0, 

and the solution in (b) is no longer the general solution. Find the gen¬ 
eral solution in this case by the method of Section 16. 

5. Some differential equations are of the hypergeometric type even 
though they may not appear to be so. Find the general solution of 

(l-e I )y" + |y' + e 1 y = 0 


near the singular point x = 0 by changing the independent variable to 
t = e x . 


tib 

6 . (a) Show that F'(a, b,c,x) = — F(a +1, b +1, c + l,x). 

c 

(b) By applying the differentiation formula in (a) to the result of 
Problem 3, show that the only solutions of Chebyshev's equation 

f i 1 — x 

whose derivatives are bounded near x = 1 are y = CiF 


Conclude that the only polynomial solutions of Chebyshev's equation 

( 1 1 ~x) 


are constant multiples of F 
integer. 


where n is a non-negative 
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The Chebyshev polynomial of degree n is denoted by T n (x ) and defined by 


T„(x)-F \ n,-n, 


1 1 — x 

2 '~ 2 ~ 


. An interesting application of these polyno¬ 


mials to the theory of approximation is discussed in Appendix D, 


32 The Point at Infinity 

It is often desirable, in both physics and pure mathematics, to study the solu¬ 
tions of 


y" + P(x)y’ + Q(x)y = 0 (1) 

for large values of the independent variable. For instance, if the variable is 
time, we may want to know how the physical system described by (1) behaves 
in the distant future, when transient disturbances have faded away 
We can adapt our previous ideas to this broader purpose by studying solu¬ 
tions near the point at infinity. The procedure is quite simple, for if we change 
the independent variable from x to 



( 2 ) 


then large x's correspond to small t's. Consequently, if we apply (2) to (1), 
solve the transformed equation near t-0, and then replace f by 1 lx in these 
solutions, we have solutions of (1) that are valid for large values of x. To carry 
out this program, we need the formulas 


and 


f = = d y dt = f _ A) = _ f 2 

dx dt dx dty x 2 ) dt 


V = 


d (dy) d (dy)dt 
dx\dx) dtidx) dx 



v 


d 2 y 

w 


■ It 


dy 

dt 


H 2 )- 


(3) 


(4) 


When these expressions are inserted in (1), and primes are used to denote 
derivatives with respect to f, then (1) becomes 


10 The notation T„(x) is used because Chebyshev's name was formerly transliterated as 
Tchebychev, Tchebycheff, or Tschebycheff. 
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y + 


2 P(l/Q 
t t 2 




(5) 


We say that equation (1) has i = “ asan ordinary point, a regular singular 
point with exponents m, and m 2 , or an irregular singular point, if the point 
f = 0 has the corresponding character for the transformed equation (5). 

As a simple illustration, consider the Euler equation 

y" + V + 4j/= 0- (6) 

X X 


A comparison of (6) with (5) shows that the transformed equation is 


y”-py' + h=°- 


(7) 


It is clear that t = 0 is a regular singular point for (7), with indicial equation 

m(m -1) - 2m + 2 = 0 


and exponents m x = 2 and m 2 = l. This means that (6) has x = °° as a regular 
singular point with exponents 2 and 1. 

Our main example is the hypergeometric equation 

x(l - x)y" + [c - (a + b + l)x]j/' -aby- 0. (8) 

We already know that (8) has two finite regular singular points: x = 0 with 
exponents 0 and 1 -c; and x = 1 with exponents 0 and c-a-b. To determine 
the nature of the point x = we substitute (3) and (4) directly into (8). After a 
little rearrangement, we find that the transformed equation is 


y + 


(l-a-b)-(2-c)t 

f(l-f) 


, ab 

V ^ -9- 

t (1-t) 


y = o. 


(9) 


This equation has t- 0 as a regular singular point with indicial equation 

m(m-l) + (l-a-b)m+ab-0 


or 


(m - a)(m -b) = 0. 

This shows that the exponents of equation (9) at t - 0 are a and b, so equation 
(8) has x = °»asa regular singular point with exponents a and b. We conclude 
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that the hypergeometric equation (8) has precisely three regular singular 
points: 0, 1, and °° with corresponding exponents 0 and 1 - c, 0 and c-a-b 
and a and b. In Appendix E we demonstrate that the form of the hypergeo¬ 
metric equation is completely determined by the specification of these three 
regular singular points together with the added requirement that at least one 
exponent must be zero at each of the points x - 0 and x=l. 

Another classical differential equation of considerable importance is the 
confluent hypergeometric equation 


xy" + (c-x)y'-ay= 0. (10) 

To understand where this equation comes from and why it bears this name, 
we consider the ordinary hypergeometric equation (8) in the form 

s(l - s) + [c - (a + b + l)s] — - aby = 0. (11) 

ds" ds 

If the independent variable is changed from s to x = bs, then we have 

dy dy dx , dy 
ds dx ds dx 


and 


d 2 y 


= b 


2 d y 
dx 2 ' 


and (11) becomes 




{c—x) — 


(a + \)x 


y’-ay = 0, 


( 12 ) 


where the primes denote derivatives with respect to x. Equation (12) has 
regular singular points at x = 0 ,x = b, and x = °°; it differs from (11) in that the 
singular point x-b is now mobile. If we let b -* then (12) becomes (10). The 
singular point at b has evidently coalesced with the one at and this conflu¬ 
ence of two regular singular points at °° is easily seen to produce an irregular 
singular point there (Problem 3). 


Problems 

1. Use (3) and (4) to determine the nature of the point x = °° for 

(a) Legendre's equation (1 - x 2 )y" - 2 xy' +p(p + T)y = 0; 

(b) Bessel's equation x 2 y" + xy' + ( x 2 - p 2 )y = 0. 
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2. Show that the change of dependent variable defined by y = t a w trans¬ 
forms equation (9) into the hypergeometric equation 

f(l - t)w" + {(1 + a - b) - [a + (1 +a - c) +1 ]t}w' 

- a(l + a - c)w - 0. 


If a and b are not equal and do not differ by an integer, conclude that the 
hypergeometric equation (8) has the following independent solutions 
for large values of x: 


yi = —F a,l + a-c,l + a-b, 


and 

y 2 =— t-F\ b,l + b-c,l + b-a, — 1 
x \ x J 

3. Verify that the confluent hypergeometric equation (10) has x = °° as an 
irregular singular point. 

4. Verify that the confluent hypergeometric equation (10) has x = 0 as a 
regular singular point with exponents 0 and 1 - c. If c is not zero or a 
negative integer, show that the Frobenius series solution corresponding 
to the exponent 0 is 


1 , a(a + l)---(a + n-l) ^ 
n!c(c + l)---(c + «-l) 

n=1 

The function defined by this series is known as the confluent hypergeo¬ 
metric function, and is often denoted by the symbol F(a,c,x). 

5. Laguerre's equation is 

xy" + (1 - x)y' +py=0, 

where p is a constant. 11 Use Problem 4 to show that the only solutions 
bounded near the origin are constant multiples of F(-p,l,x), and also 
that these solutions are polynomials if p is a nonnegative integer. The 
functions L n (x)-F(-n, l,x), where n = 0, 1, 2, ..., are called Laguerre poly¬ 
nomials ; they have important applications in the quantum mechanics of 
the hydrogen atom. 


11 Edmond Laguerre (1834-1886) was a professor at the College de France in Paris, and worked 
primarily in geometry and the theory of equations. He was one of the first to point out that a 
"reasonable" distance function (metric) can be imposed on the coordinate plane of analytic 
geometry in more than one way. 
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Appendix A. Two Convergence Proofs 

Proof of Theorem 28-A (conclusion). Our assumption is that the series 

OO 00 

P(x) = ^p n x n and Q(x)=^q n x n (1) 

n= 0 n= 0 

converge for | x \ < R, R > 0. We must prove that the series 

oo 

y = (2) 

n =0 

converges at least on the same interval if a 0 and a } are arbitrary and if a n+2 is 
defined recursively for n > 0 by 

n 

(n +1 )(n + 2 )a n+2 = - V[(fc + V)p n - k a k+ i + q n - k a k ] . (3) 

k =0 


Let r be a positive number such that r<R. Since the series (1) converge for 
x = r, and the terms of a convergent series approach zero and are therefore 
bounded, there exists a constant M > 0 such that 

| p n | r” < M and | q n \r n <M 
for all n. Using these inequalities in (3), we find that 

A r n 

(n + l)(n + 2)|fl„ +2 |< — ^[(k + l)\a k+1 \ + \a k \]r k 

r k =o 

A/T _ n 

<—^[(k + l)\a k+1 \ + \a k \]r k + M\a n+1 \r, 

r k =0 


where the term M | a n+1 \ r is inserted because it will be needed below. We now 
define b 0 = | a 0 \, b 1 = \a 1 \, and b n+2 (for n > 0) by 

A/T _ n 

(n +1 )(n + 2 )b n+2 = — ^[(fc + l)b k+1 + b k ]r k + Mb n+1 r. 
r k =o 


( 4 ) 
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It is clear that 0 <| a„ \ < b n for every n. We now try to learn something about the 
values of x for which the series 


( 5 ) 

n =0 

converges, and for this we need information about the behavior of the ratio 
b n+l /b n as n -> We acquire this information at follows. Replacing n in (4) 
first by n -1 and then by n - 2 yields 

a/t 

n(n +1 )b n+1 = + l)b k+1 + b k ]r k + Mb n r 

r k=0 


and 


(n - l)nb n = + b k]r k + Mb n ^r. 

r k =o 


By multiplying the first of these equations by r and using the second, we 
obtain 

A/T .I* 2 , 

m(n +1 )b n+1 = +1 )b k+1 + b k ]r k 

r k =o 

+ rM{nb„ + b n _ 1 ) + Mb„r 2 
= (n -1 )nb„ -Mb n _ir + rM(nb n + b,,^) + Mb n r 2 
- [(n-l)n + rMn + Mr 2 ]b„, 


so 


b n+ \ _ (n -1 )n + rMn + Mr 2 
b n rn(n + 1) 

This tells us that 

b n+ ix" +1 = tvn > 1 x 1 

b„x n b n r 

The series (5) therefore converges for | x\< r, so by the inequality \a n \<b n and 
the comparison test, the series (2) also converges for | x \ < r. Since r was an 
arbitrary positive number smaller than R, we conclude that (2) converges 
for |x|<R, and the proof is complete. 
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Proof of Theorem 30-A (conclusion). The argument is similar to that just 
given for Theorem 28-A, but is sufficiently different in its details to merit 
separate consideration. We assume that the series 


xP(x) = p n x” and x 2 Q(x) = q„x n (6) 

n =0 n =0 


converge for | x \ < R, R > 0. The indicial equation is 

/ (m) = m(m -1) + mp 0 + q 0 =Q, (7) 

and we consider only the case in which (7) has two real roots m x and m 2 with 
m 2 < m v The series whose convergence behavior we must examine is 


( 8 ) 

n=0 


where a 0 is an arbitrary nonzero constant and the other a n are defined recur¬ 
sively in terms of a 0 by 

n 1 

f(m + n)a„ =-'S\ k [{m+k)p n „ k + q n _ k \ . (9) 

k =0 


Our task is to prove that the series (8) converges for |x|<.R if m = m v and also 
if m = m 2 and m 1 - m 2 is not a positive integer. 

We begin by observing that/(wz) can be written in the form 

/ ( m) = (m - mfim - m 2 ) = m 2 - {rn r + mf)m + mpn^ 

With a little calculation, this enables us to write 

/ (m j + n)=n(n+m 1 - mf) 

and 


and consequently 


and 


f(m 2 + n)-n(n + m 2 - mf; 


\f(m 1 +n)\ >n(n-\m 1 -m 2 \) 


( 10 ) 


| f(m 2 + n) | > n(n - \ m 2 - m 1 \). 


( 11 ) 
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Let r be a positive number such that r<R. Since the series (6) converge for 
x = r, there exists a constant M > 0 with property that 

\p„\r n <M and \q n \ r n <M (12) 

for all n. If we put m=m 1 in (9) and use (10) and (12), we obtain 

n -1 - - 

n(n-\m 1 -m 2 1)| a n |<M^J-^j(| m 1 \ + k + l). 

k =0 V 


We now define a sequence {b n } by writing 

b„- \a n \ for 0 <n< \m 1 -m 2 


and 


n(n -1 m x - m 2 | )b n = M 


2Jsf(KI + * + 1 ) 


k =0 


(13) 


for n > | m 1 -m 2 \. It is clear that 0 < | a n \ < b u for every n. We shall prove that the 
series 


( 14 ) 

n =0 


converges for |x|<r, and to achieve this we seek a convenient expression for 
the ratio b n+1 /b n . By replacing n by n +1 in (13), multiplying by r, and using (13) 
to simplify the result, we obtain 

r(n +1 )(n +1 -1 m 1 -m 2 \) b n+1 

= n(n- -m 2 \)b n +Mb n (\m 1 \+ n + V), 


so 

b„+i _ n(n- 1 nil - |) + M(| nil \ +n + 1) 

b n r(n + l)(n+l-\mi~m 2 \) 

This tells us that 


bn+i% _ b n+ 1 | |_ | x 

b„x n b„ 


r 
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so (14) converges for |x|<r. It now follows from 0 < \a n \ < b n that (8) also con¬ 
verges for | x | < r; and since r was taken to be an arbitrary positive number 
smaller than R, we conclude that (8) converges for | x \ < R. If m t is everywhere 
replaced by m 2 and (11) is used instead of (10), then the same calculations 
prove that in this case the series (8) also converges for |x|<_R—assuming, 
of course, that m 1 -m 2 is not a positive integer so that the series (8) is well 
defined. 


Appendix B. Hermite Polynomials and Quantum Mechanics 

The most important single application of the Hermite polynomials is to the 
theory of the linear harmonic oscillator in quantum mechanics. A differ¬ 
ential equation that arises in this theory and is closely related to Hermite's 
equation (Problem 28-7) is 


d -^- + (2p + l-x 2 )zv = 0, (1) 

dx 

where p is a constant. For reasons discussed at the end of this appendix, 
physicists are interested only in solutions of (1) that approach zero as |x| -* 

If we try to solve (1) directly by power series, we get a three-term recursion 
formula for the coefficients, and this is too inconvenient to merit further con¬ 
sideration. To simplify the problem, we introduce a new dependent variable 
y by means of 


w = ye 


-x 2 /2 


( 2 ) 


This transforms (1) into 


^--2x^ + 2py = °, (3) 

dx dx 

which is Hermite's equation. The desired solutions of (1) therefore correspond 
to the solutions of (3) that grow in magnitude (as |x|—>°°) less rapidly than 
e x 1 , and we shall see that these are essentially the Hermite polynomials. 

Physicists motivate the transformation (2) by the following ingenious 
argument. When x is large, the constant Ip +1 in equation (1) is negligible 
compared with x 2 , so (1) is approximately 
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, 2 

It is not too outrageous to guess that the functions w = e x ' might be 
solutions of this equation. We now observe that 

w' = ±xe ±x2//2 and zv” = x 2 e ±x2 / 2 ± e** 2 / 2 ; 

and since for large x the second term of zv" can be neglected compared with 
the first, it appears that w = e x 2 and w = e ~ xl/2 are indeed "approximate solu¬ 
tions" of (1). The first of these is now discarded because it does not approach 
zero as |x| -> °°. It is therefore reasonable to suppose that the exact solution 
of (1) has the form (2), where we hope that the function y(x) has a simpler 
structure than w(x). 

Whatever one thinks of this reasoning, it works. For we have seen in 
Problem 28-7 that Hermite's equation (3) has a two-term recursion formula 


2 (p-n) 

n+2 (n + l)(n + 2) 


and also that this formula generates two independent series solutions 


yi(*)=i-§ 


x 2 + : 


Ap~ 2 ) x 4 2 p(p-2)(p-4) x6 | 
4! 6! 


(4) 


(5) 


and 


y 2 (x) = x- 2 ^x 
7 3! 5! 

2 3 (p-l)(p- 3)(p-5) 7 [ 

7! 


( 6 ) 


that converge for all x. 

We now compare the rates of growth of the functions i/,(x) and e x /2 . Our 
purpose is to prove that 


Vl(x) n I I 

J ^0 as xk® 

e P/2 I I 

if and only if the series for i/,(x) breaks off and is a polynomial, that is, if and 
only if the parameter p has one of the values 0,2,4,.... The "if" part is clear by 
THospital's rule. To prove the "only if" part, we assume that p / 0,2,4, ..., and 
show that in this case the above quotient does not approach zero. To do this. 
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we use the fact that yfx) has the form yfx) = f ainX 1 " with its coefficients 
determined by (4) and the condition a 0 = 1, and also that e x 2 has the series 
expansion e x ^ 2 = ^J) 2n x 2n where b 2n -l/(2 n n\), so 


yi(x) a 0 + a 2 x -ra^x + ■ ■ ■ + a 2n x +••• 
e * 2 / 2 b 0 + b 2 x 2 + fr 4 x 4 + • • • + b 2n x 2n + ■■■ 

Formula (4) tells us that all coefficients in the numerator with sufficiently 
large subscripts have the same sign, so without loss of generality these coef¬ 
ficients may be assumed to be positive. To prove that our quotient does not 
approach zero as |x| it therefore suffices to show that a 2n > b 2n if n is 

large enough. To establish this, we begin by observing that 

flin+2 _ _ 2(p —2n) b 2n+2 _ 1 

a 2n ~ (2n + l)(2n + 2) b 2n ~ 2{n + \)' 

so 

tiin+i/ain _ 2(p-2n)2(n + l) 

^2il+2 /b 2n (2n + l)(2n + 2) 

This implies that 


Cl2n+2 

bm+2 


3 fl2n 
2 b 2 n 


for all sufficiently large ns. If N is any one of these n's, then repeated applica¬ 
tion of this inequality shows that 


Cl2N+2k f 31 U2N ^ 

b2N+2k V 2 J b 2N 

for all sufficiently large k's, so a 2 Jb 2n > 1 or a 2n >b 2n if n is large enough. 
The above argument proves that yfx)e~ x ri 0 as |x| °° if and only if 

the parameter p has one of the values 0,2,4, .... Similar reasoning yields the 
same conclusion for y 2 (x)e~ x 12 (with p = 1 , 3 , 5 , ...), so the desired solutions of 
Hermite's equation are constant multiples of the Hermite polynomials H 0 (x), 
Hfx), H 2 (x), ... defined in Problem 28-7. 
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The generating function and Rodrigues' formula. We have seen how the 
Hermite polynomials arise, and we now turn to a consideration of their most 
useful properties. The significance of these properties will become clear at 
the end of this appendix. 

These polynomials are often defined by means of the following power 
series expansion: 

e 2xt -‘ 2 = = H 0 (x) + H 1 (x)t + ^p-t 2 +---. (7) 


The function e 2xt is called the generating function of the Hermite polynomi¬ 
als. This definition has the advantage of efficiency for deducing properties of 
the H n (x), and the obvious weakness of being totally unmotivated. We shall 
therefore derive (7) from the series solutions (5) and (6). 

All polynomial solutions of (3) are obtained from these series by replacing 
p by an integer n > 0 and multiplying by an arbitrary constant. They all have 
the form 


h n (x) — * ■ • + a n ^^x a n - 4 X + a n _ 2 % ~^~a n x 
= a n x n + a„_ 2 x n ~ 2 + a n - 4 x n ~ 4 + a„_ 6 x"~ 6 + • • 


where the sum last written ends with a 0 or ape according as n is even or odd 
and its coefficients are related by 


a k+2 ~ ~ 


2 (n-k) 

{k + l)(k + 2 ) k ' 


( 8 ) 


We shall find a n _ 2 , a n _ 4 ,... in terms of a n , and to this end we replace k in (8) by 
k -2 and get 


o-k — — 


2 (n-k + 2 ) 

(*-l )k 


ilk-2 


or 


k(k-F) 
2 (n-k + 2 ) 


a k . 


a k-2 - ~ 
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Letting k be n, n - 2, n - 4, etc., yields 


7 

U n -2 — - ———“nr 

2-2 

_ (n - 2)(n - 3) 

a ,,-4 --—:-“ h -2 

2-4 

n(n-l)(n-2)(n-3) 
“ 2 2 • 2 ■ 4 

_ (n-4)(n-5) 

U n -6 — -- a n-4 

2-6 


n(n - 1 )(n - 2)(n - 3 )(n - 4 )(n - 5) 
2 3 -2-4-6 


d n , 


and so on, so 


h n (x) = a n [x 


n n(n-l) x „_ 2 { n(n-l)(n-2)(n-3) x „_ 4 
2-2 2 2 • 2•4 


n(n -1 ){n -2)(n- 3 )(n - 4)(n - 5) ^»_ 6 + 
2' • 2 • 4 • 6 


+ ( -_^/c n(n-l)-(n-2k + l) x n- 2 k + _ 


2 •2-4---(2fc) 
This expression can be written in the form 


[n/2] 


h 


„(*) = «„£(-!)* 


n\ 


n-2k 


2 lk k\{n-2k)\ 


k=0 


where [n/2] is the standard notation for the greatest integer < n/2. To get the 
nth Hermite polynomial H„(x), we put a n = 2" and obtain 


In/2] 


H n (x) = ^(-If 


n\ 


k =o 


kl(n-2k)\ 


(2x) n 


( 9 ) 


This choice for the value of a n is purely a matter of convenience; it has the 
effect of simplifying the formulas expressing the various properties of the 
Hermite polynomials. 

In order to make the transition from (9) to (7), we digress briefly. The defin¬ 
ing formula for the product of two power series. 


. oo x 


oo n 


^ ~ji n t n ^ \ t t ” 'y'fkb 

V n =0 J v n =0 J n =0 \ n=0 ) 


n-k 


t n 
*- / 
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is awkward to use when the first series contains only even powers of f: 




V h=o y v n =o 


- 7 


What we want to do here is gather together the nth powers of t from all pos¬ 
sible products a k t 2k bjti, so 2 k +j = n and the terms we consider are a k t 2k b n _ 2k t n ~ 2k . 
The restrictions are k > 0 and n - 2k > 0, so 0 < k < n/2; and for each n > 0 we 
see that k varies from 0 to the greatest integer < n/2. This yields the product 
formula 




V n =0 J V «=0 J 


oo ([n/2] 

z 

n =0 V 


L”/ *-J 

^^a k b n - 2 k 


t n . 


( 10 ) 


If we now insert (9) into the right side of (7) and use (10), we obtain 


n \ Lu Lu 


n =0 


n =0 k =0 


(-1)^2^)" 

k\(n-2k)\ 


t n 


y(-i)" ,2,Jy(2xr ' 
^ n! n\ 

n=0 n=0 


Z4r I 

n =0 _ _ n =0 


i (2 xty 


n\ 


= e -‘e 2xt =e 2xt - t 


which establishes (7). 

As an application of (7) we prove Rodrigues' formula for the Hermite 
polynomials: 


H„(x) = (-l)V' 2 


( 11 ) 


In view of formula 26-(9) for the coefficients of a power series, (7) yields 


H„(x) = 


r d n ■ 
— e‘ 
dt n 


= e 


/f=0 


<r_ 

cT 




h=o 
































256 


Differential Equations with Applications and Historical Notes 


If we introduce a new variable z-x-t and use the fact that d/dt = -(d/dz), then 
since t - 0 corresponds to z-x, the expression last written becomes 


(-1)V 


d n 

dz n 


A=3 


, 1V! x 2 d” - 

= (-l)'e - e 

K ' dx n 


and the proof is complete. 


Orthogonality. We know that for each nonnegative integer n the function 

w n (x) = e~^ /2 H n (x), (12) 

called the Hermite function of order n, approaches zero as \x |-> °° and is a solu¬ 
tion of the differential equation 

w"„ +(2n +l-x 2 )w n = 0. (13) 

An important property of these functions is the fact that 


1 


w m w n dx 


—co 



H m (x)H n (x) dx = 0 


—co 


if rn n. 


(14) 


This relation is often expressed by saying that the Hermite functions are 
orthogonal on the interval (-°°, °°). 

To prove (14) we begin by writing down the equation satisfied by w m (x), 

w" m + (2m + l-x 2 )w m =0. (15) 

Now, multiplying (13) by w m and (15) by w n and subtracting, we obtain 

—(w'„w m - w' m w n ) + 2 (n - m)w m w n = 0. 
dx 

If we integrate this equation from to °° and use the fact that w’„w m - w’ m w n 
vanishes at both limits, we see that 


2 (n- 


oo 


w m w n dx = 0, 


which implies (14) 
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We will also need to know that the value of the integral in (14) when m = n is 

00 

je~ x2 [H„(x)] 2 dx = Tn\yfn. (16) 

—oo 

To establish this, we use Rodrigues' formula (11) and integrate 

oo oo 

je~ x2 H n (x)H n (x)dx = (-1 r^H n (x)^e- xl dx 


by parts, with 


u = H n (x), du = H' n (x) dx, 

, d" d" -1 

ai7 =- e dx, v = - T e ' . 

dx n dx"- 1 


Since uv is the product of e 1 and a polynomial, it vanishes at both limits and 

00 00 n -1 

j> 2 [H„(x)] 2 dx = (-1)” +1 Jh,' ( X )£^e- x2 dx 
—00 —00 

oo ^_2 

= (-1)" +2 j" H"„(x) j^n -2 e~ x2 dx 

—oo 

oo 

= ••■ = (-! f n J H ( „ n) (x)e- xl dx. 


Now the term containing the highest power of x in H„(x) is 2"x", so 
H^"\x) = 2 "n ! and the last integral is 


oo oo 

l"n\ je~ x2 dx = (2"n!)2je- x2 dx = Tn'.Jn, 


which is the desired result. 1 


12 The fact that the integral of e x from 0 to “ is 4n/2 is often proved in elementary calculus. 
See Equation 3 in Appendix 1A. 
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These orthogonality properties can be used to expand an "arbitrary" func¬ 
tion/^) in a Hermite series: 


f(x) = J^a n H n (x). (17) 

n=0 


If we proceed formally, the coefficients a n can be found by multiplying 
(17) by e~ x H m (x) and integrating term by term from to By (14) and 
(16) this gives 


oo oo 00 

je^' 2 H m (x)f(x)dx = ^Ta„ je~ x ~H m (x)H n (x)dx = a m T'm\yfa, 

—oo n=0 _ co 


so (replacing m by n) 


(in = — ] r x2 H n (x)f(x)dx. 
1 n\\Jn J 


(18) 


This formal procedure suggests the mathematical problem of determin¬ 
ing conditions on the function/ (x) that guarantee that (17) is valid when 
the a n ‘ s are defined by (18). Problems of this kind are part of the general 
theory of orthogonal functions. Some direct physical applications of 
orthogonal expansions like (17) are discussed in Appendices A and B of 
Chapter 8. 


The harmonic oscillator. As we stated at the beginning, the mathematical 
ideas developed above have their main application in quantum mechan¬ 
ics. An adequate discussion of the underlying physical concepts is clearly 
beyond the scope of this appendix. Nevertheless, it is quite easy to under¬ 
stand the role played by the Hermite polynomials H n (x) and the correspond¬ 
ing Hermite functions e~ x /2 H n (x). 

In Section 20 we analyzed the classical harmonic oscillator, which can be 
thought of as a particle of mass m constrained to move along the x-axis and 
bound to the equilibrium position x = 0 by a restoring force -kx. The equation 
of motion is 


m 


d 2 x 

dt 2 


= -kx; 
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and with suitable initial conditions, we found that its solution is the har¬ 
monic oscillation 


X = x 0 cos 



where x 0 is the amplitude. We also recall that the period T is given by 

T = In^Jm/k ; and since the vibrational frequency v is the reciprocal of the 

period, we have k = An 2 mv 2 . Furthermore, since the kinetic energy is —m(dx/ 

1 ^ 
dt) 2 and the potential energy is ^kx 2 , an easy calculation shows that the total 

1 2 

energy of the system is E = ^ /ay, a constant. This total energy may clearly 
take any positive value whatever. 

In quantum mechanics, the Schrodinger wave equation for the harmonic 
oscillator described above is 


d 2 \\i 8n 2 m 
dx 2 h 2 



\|/ = 0 , 


(19) 


where £ is again the total energy, h is Planck's constant, and satisfactory solu¬ 
tions \|/(x) are known as Schrodinger wave functions} 3 If we use the equation 
k=An 2 mv 2 to eliminate the force constant k, then (19) can be written in the 
form 


d \|/ 871 Ul . 2 2 2\ n 

—-£- + —s— (E-2n mv x )\|/ = 0. (20) 

dx~ h 

The physically admissible (or "civilized") solutions of this equation are those 
satisfying the conditions 


\\i —» 0 as | x \—> co and J | \|/1 2 dx = 1. (21) 

—oo 

These solutions—the Schrodinger wave functions—are also called the eigen¬ 
functions of the problem, and we shall see that they exist only when £ has 
certain special values called eigenvalues. 


13 Erwin Schrodinger (1887-1961) was an Austrian theoretical physicist who shared the 1933 
Nobel Prize with Dirac. His scientific work can be appreciated only by experts, but he was 
a man of broad cultural interests and was a brilliant and lucid writer in the tradition of 
Poincare. He liked to write pregnant little books on big themes: What Is Life?, Science and 
Humanism, Nature and the Greeks, Cambridge University Press, New York, 1944, 1952, 1954, 
respectively. 
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If we change the independent variable to 



then (20) becomes 


d 2 w (2E 2 

—T +- 11 

du \hv 


i|/ = 0 


and conditions (21) become 



—oo 


( 22 ) 


(23) 


(24) 


Except for notation, equation (23) has exactly the form of equation (1), so we 
know that it has solutions satisfying the first condition of (24) if and only if 
2E/lw=2n + l or 


E = hv^n + ^j (25) 

for some non-negative integer n. We also know that in this case these solu¬ 
tions of (23) have the form 


\\i = ce l,1/1 H n (ii) 

where c is a constant. If we now impose the second condition of (24) and use 
(16), then it follows that 


c = 


Anvm 
2 2n {n\fh 


1/4 


The eigenfunction corresponding to the eigenvalue (25) is therefore 


V = 


Anvm 
2 2n (n\) 2 h 


1/4 


-u 2 / 2 


H n (u), 


(26) 


where (22) gives u in terms of x. 
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Physicists have a deep professional interest in the detailed properties of 
these eigenfunctions. For us, however, the problem is only an illustration 
of the occurrence of the Hermite polynomials, so we will not pursue the 
matter any further—beyond pointing out that formula (25) yields the so- 
called quantized energy levels of the harmonic oscillator. This means that 
the energy E may assume only these discrete values, which of course is 
very different from the corresponding classical situation described above. 
The simplest concrete application of these ideas is to the vibrational motion 
of the atoms in a diatomic molecule. When this phenomenon is studied 
experimentally, the observed energies are found to be precisely in accord 
with (25). 


NOTE ON HERMITE. Charles Hermite (1822-1901), one of the most emi¬ 
nent French mathematicians of the nineteenth century, was particularly 
distinguished for the elegance and high artistic quality of his work. As a 
student, he courted disaster by neglecting his routine assigned work to study 
the classic masters of mathematics; and though he nearly failed his examina¬ 
tions, he became a first-rate creative mathematician himself while still in his 
early twenties. In 1870 he was appointed to a professorship at the Sorbonne, 
where he trained a whole generation of well known French mathematicians, 
including Picard, Borel, and Poincare. 

The unusual character of his mind is suggested by the following remark 
of Poincare: "Talk with M. Hermite. He never evokes a concrete image, yet 
you soon perceive that the most abstract entities are to him like living crea¬ 
tures." He disliked geometry, but was strongly attracted to number theory 
and analysis, and his favorite subject was elliptic functions, where these 
two fields touch in many remarkable ways. The reader may be aware that 
Abel had proved many years before that the general polynomial equation 
of the fifth degree cannot be solved by functions involving only rational 
operations and root extractions. One of Hermite's most surprising achieve¬ 
ments (in 1858) was to show that this equation can be solved by elliptic 
functions. His 1873 proof of the transcendence of e was another high point 
of his career. 

Several of his purely mathematical discoveries had unexpected appli¬ 
cations many years later to mathematical physics. For example, the 
Hermitian forms and matrices he invented in connection with certain 
problems of number theory turned out to be crucial for Heisenberg's 
1925 formulation of quantum mechanics, and we have seen that Hermite 
polynomials and Hermite functions are useful in solving Schrodinger's 
wave equation. The reason is not clear, but it seems to be true that math¬ 
ematicians do some of their most valuable practical work when thinking 
about problems that appear to have nothing whatever to do with physical 
reality. 
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Appendix C. Gauss 

Carl Friedrich Gauss (1777-1855) was the greatest of all mathematicians 
and perhaps the most richly gifted genius of whom there is any record. 
This gigantic figure, towering at the beginning of the nineteenth century, 
separates the modern era in mathematics from all that went before. His 
visionary insight and originality, the extraordinary range and depth of his 
achievements, his repeated demonstrations of almost superhuman power 
and tenacity—all these qualities combined in a single individual present an 
enigma as baffling to us as it was to his contemporaries. 

Gauss was born in the city of Brunswick in northern Germany. His 
exceptional skill with numbers was clear at a very early age, and in later 
life he joked that he knew how to count before he could talk. It is said that 
Goethe wrote and directed little plays for a puppet theater when he was 
six, and that Mozart composed his first childish minuets when he was five, 
but Gauss corrected an error in his father's payroll accounts at the age of 
three. 14 His father was a gardener and bricklayer without either the means 
or the inclination to help develop the talents of his son. Fortunately, how¬ 
ever, Gauss's remarkable abilities in mental computation attracted the inter¬ 
est of several influential men in the community, and eventually brought 
him to the attention of the Duke of Brunswick. The Duke was impressed 
with the boy and undertook to support his further education, first at the 
Caroline College in Brunswick (1792-1795) and later at the University of 
Gottingen (1795-1798). 

At the Caroline College, Gauss completed his mastery of the classical lan¬ 
guages and explored the works of Newton, Euler, and Lagrange. Early in 
this period—perhaps at the age of fourteen or fifteen—he discovered the 
prime number theorem, which was finally proved in 1896 after great efforts 
by many mathematicians (see our notes on Chebyshev and Riemann). He 
also invented the method of least squares for minimizing the errors inherent 
in observational data, and conceived the Gaussian (or normal) law of distri¬ 
bution in the theory of probability. 

At the university. Gauss was attracted by philology but repelled by the 
mathematics courses, and for a time the direction of his future was uncer¬ 
tain. However, at the age of eighteen he made a wonderful geometric discov¬ 
ery that caused him to decide in favor of mathematics and gave him great 
pleasure to the end of his life. The ancient Greeks had known ruler-and- 
compass constructions for regular polygons of 3,4,5, and 15 sides, and for all 
others obtainable from these by bisecting angles. But this was all, and there 
the matter rested for 2000 years, until Gauss solved the problem completely. 


14 See W. Sartorius von Waltershausen, "Gauss zum Gedachtniss." These personal recollec¬ 
tions appeared in 1856, and a translation by Helen W. Gauss (the mathematician's great- 
granddaughter) was privately printed in Colorado Springs in 1966. 
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He proved that a regular polygon with n sides is constructible if and only 
if n is the product of a power of 2 and distinct prime numbers of the form 
p k = 2 2 +1. In particular, when k = 0,1,2,3, we see that each of the correspond¬ 
ing numbers p k = 3,5,17,257 is prime, so regular polygons with these numbers 
of sides are constructible. 15 

During these years Gauss was almost overwhelmed by the torrent of ideas 
which flooded his mind. He began the brief notes of his scientific diary in an 
effort to record his discoveries, since there were far too many to work out in 
detail at that time. The first entry, dated March 30,1796, states the construct- 
ibility of the regular polygon with 17 sides, but even earlier than this he was 
penetrating deeply into several unexplored continents in the theory of num¬ 
bers. In 1795 he discovered the law of quadratic reciprocity, and as he later 
wrote, "For a whole year this theorem tormented me and absorbed my great¬ 
est efforts, until at last I found a proof," 16 At that time Gauss was unaware 
that the theorem had already been imperfectly stated without proof by Euler, 
and correctly stated with an incorrect proof by Legendre. It is the core of the 
central part of his famous treatise Disqidsitiones Arithmeticae, which was pub¬ 
lished in 1801 although completed in 1798! 7 Apart from a few fragmentary 
results of earlier mathematicians, this great work was wholly original. It is 
usually considered to mark the true beginning of modern number theory, to 
which it is related in much the same way as Newton's Principia is to phys¬ 
ics and astronomy. In the introductory pages Gauss develops his method of 
congruences for the study of divisibility problems and gives the first proof of 
the fundamental theorem of arithmetic (also called the unique factorization 
theorem), which asserts that every integer n > 1 can be expressed uniquely as 
a product of primes. The central part is devoted mainly to quadratic congru¬ 
ences, forms, and residues. The last section presents his complete theory of 
the cyclotomic (circle-dividing) equation, with its applications to the con- 
structibility of regular polygons. The entire work was a gargantuan feast of 
pure mathematics, which his successors were able to digest only slowly and 
with difficulty. 

In his Disqidsitiones Gauss also created the modern rigorous approach to 
mathematics. He had become thoroughly impatient with the loose writing 
and sloppy proofs of his predecessors, and resolved that his own works 
would be beyond criticism in this respect. As he wrote to a friend, "I mean 
the word proof not in the sense of the lawyers, who set two half proofs equal 
to a whole one, but in the sense of the mathematician, where 1/2 proof = 0 
and it is demanded for proof that every doubt becomes impossible." The 
Disqidsitiones was composed in this spirit and in Gauss's mature style, which 


15 Details of some of these constructions are given in H. Tietze, Famous Problems of Mathematics, 
chap. IX, Graylock Press, New York, 1965. 

16 See D. W. Smith, A Source Book in Mathematics, pp. 112-118, McGraw-Hill, New York, 1929. 
This selection includes a statement of the theorem and the fifth of eight proofs that Gauss 
found over a period of many years. There are probably more than 50 known today. 

17 There is a translation by Arthur A. Clarke (Yale University Press, New Haven, Conn., 1966). 
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is terse, rigorous, devoid of motivation, and in many places so carefully pol¬ 
ished that it is almost unintelligible. In another letter he said, "You know that 
I write slowly This is chiefly because I am never satisfied until I have said 
as much as possible in a few words, and writing briefly takes far more time 
than writing at length." One of the effects of this habit is that his publica¬ 
tions concealed almost as much as they revealed, for he worked very hard 
at removing every trace of the train of thought that led him to his discover¬ 
ies. Abel remarked, "He is like the fox, who effaces his tracks in the sand 
with his tail." Gauss replied to such criticisms by saying that no self-respect¬ 
ing architect leaves the scaffolding in place after completing his building. 
Nevertheless, the difficulty of reading his works greatly hindered the diffu¬ 
sion of his ideas. 

Gauss's doctoral dissertation (1799) was another milestone in the history 
of mathematics. After several abortive attempts by earlier mathematicians— 
d'Alembert, Euler, Lagrange, Laplace—the fundamental theorem of algebra 
was here given its first satisfactory proof. This theorem asserts the existence 
of a real or complex root for any polynomial equation with real or complex 
coefficients. Gauss's success inaugurated the age of existence proofs, which 
ever since have played an important part in pure mathematics. Lurthermore, 
in this first proof (he gave four altogether) Gauss appears as the earliest 
mathematician to use complex numbers and the geometry of the complex 
plane with complete confidence. 18 

The next period of Gauss's life was heavily weighted toward applied math¬ 
ematics, and with a few exceptions the great wealth of ideas in his diary and 
notebooks lay in suspended animation. 

In the last decades of the eighteenth century, many astronomers were 
searching for a new planet between the orbits of Mars and Jupiter, where 
Bode's law (1772) suggested that there ought to be one. The first and largest 
of the numerous minor planets known as asteroids was discovered in that 
region in 1801, and was named Ceres. This discovery ironically coincided 
with an astonishing publication by the philosopher Hegel, who jeered at 
astronomers for ignoring philosophy: this science (he said) could have saved 
them from wasting their efforts by demonstrating that no new planet could 
possibly exist. 19 Hegel continued his career in a similar vein, and later rose 
to even greater heights of clumsy obfuscation. Unfortunately the tiny new 
planet was difficult to see under the best of circumstances, and it was soon 
lost in the light of the sky near the sun. The sparse observational data posed 
the problem of calculating the orbit with sufficient accuracy to locate Ceres 
again after it had moved away from the sun. The astronomers of Europe 
attempted this task without success for many months. Linally, Gauss was 


18 The idea of this proof is very clearly explained by F. Klein, Elementary Mathematics from an 
Advanced Standpoint, pp. 101-104, Dover, New York, 1945. 

19 See the last few pages of "De Orbitis Planetarum," vol. I of Georg Wilhelm Hegel's Samtliche 
Werke, Frommann Verlag, Stuttgart, 1965. 
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attracted by the challenge; and with the aid of his method of least squares 
and his unparalleled skill at numerical computation he determined the 
orbit, told the astronomers where to look with their telescopes, and there 
it was. He had succeeded in rediscovering Ceres after all the experts had 
failed. 

This achievement brought him fame, an increase in his pension from 
the Duke, and in 1807 an appointment as professor of astronomy and first 
director of the new observatory at Gottingen. He carried out his duties 
with his customary thoroughness, but as it turned out, he disliked admin¬ 
istrative chores, committee meetings, and all the tedious red tape involved 
in the business of being a professor. He also had little enthusiasm for teach¬ 
ing, which he regarded as a waste of his time and as essentially useless 
(for different reasons) for both talented and untalented students. However, 
when teaching was unavoidable he apparently did it superbly. One of his 
students was the eminent algebraist Richard Dedekind, for whom Gauss's 
lectures after the passage of 50 years remained "unforgettable in memory 
as among the finest which I have ever heard." 20 Gauss had many opportu¬ 
nities to leave Gottingen, but he refused all offers and remained there for 
the rest of his life, living quietly and simply, traveling rarely, and working 
with immense energy on a wide variety of problems in mathematics and 
its applications. Apart from science and his family—he had two wives and 
six children, two of whom emigrated to America—his main interests were 
history and world literature, international politics, and public finance. He 
owned a large library of about 6000 volumes in many languages, includ¬ 
ing Greek, Latin, English, French, Russian, Danish, and of course German. 
His acuteness in handling his own financial affairs is shown by the fact 
that although he started with virtually nothing, he left an estate over a 
hundred times as great as his average annual income during the last half 
of his life. 

In the first two decades of the nineteenth century Gauss produced a steady 
stream of works on astronomical subjects, of which the most important was 
the treatise Theoria Motus Corporum Coelestium (1809). This remained the 
bible of planetary astronomers for over a century. Its methods for dealing 
with perturbations later led to the discovery of Neptune. Gauss thought of 
astronomy as his profession and pure mathematics as his recreation, and 
from time to time he published a few of the fruits of his private research. His 
great work on the hypergeometric series (1812) belongs to this period. This 
was a typical Gaussian effort, packed with new ideas in analysis that have 
kept mathematicians busy ever since. 


20 Dedekind's detailed recollections of this course are given in G. Waldo Dunnington, Carl 
Friedrich Gauss: Titan of Science, pp. 259-261, Hafner, New York, 1955. This book is useful 
mainly for its many quotations, its bibliography of Gauss's publications, and its list of the 
courses he offered (but often did not teach) from 1808 to 1854. 
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Around 1820 he was asked by the government of Hanover to supervise a 
geodetic survey of the kingdom, and various aspects of this task—includ¬ 
ing extensive field work and many tedious triangulations—occupied him 
for a number of years. It is natural to suppose that a mind like his would 
have been wasted on such an assignment, but the great ideas of science are 
born in many strange ways. These apparently unrewarding labors resulted 
in one of his deepest and most far-reaching contributions to pure mathemat¬ 
ics, without which Einstein's general theory of relativity would have been 
quite impossible. 

Gauss's geodetic work was concerned with the precise measurement of 
large triangles on the earth's surface. This provided the stimulus that led 
him to the ideas of his paper Disquisitiones generates circa superficies curvas 
(1827), in which he founded the intrinsic differential geometry of general 
curved surfaces. 21 In this work he introduced curvilinear coordinates u and 
v on a surface; he obtained the fundamental quadratic differential form 
ds 2 = E du 2 + 2F du dv + G dv 2 for the element of arc length ds, which makes 
it possible to determine geodesic curves; and he formulated the concepts 
of Gaussian curvature and integral curvature. 22 His main specific results 
were the famous theorema egregium, which states that the Gaussian curva¬ 
ture depends only on £, F, and G, and is therefore invariant under bend¬ 
ing; and the Gauss-Bonnet theorem on integral curvature for the case of 
a geodesic triangle, which in its general form is the central fact of mod¬ 
ern differential geometry in the large. Apart from his detailed discoveries, 
the crux of Gauss's insight lies in the word intrinsic, for he showed how to 
study the geometry of a surface by operating only on the surface itself and 
paying no attention to the surrounding space in which it lies. To make this 
more concrete, let us imagine an intelligent two-dimensional creature who 
inhabits a surface but has no awareness of a third dimension or of anything 
not on the surface. If this creature is capable of moving about, measuring 
distances along the surface, and determining the shortest path (geodesic) 
from one point to another, then he is also capable of measuring the Gaussian 
curvature at any point and of creating a rich geometry on the surface—and 
this geometry will be Euclidean (flat) if and only if the Gaussian curvature 
is everywhere zero. When these conceptions are generalized to more than 
two dimensions, then they open the door to Riemannian geometry, tensor 
analysis, and the ideas of Einstein. 

Another great work of this period was his 1831 paper on biquadratic resi¬ 
dues. Here he extended some of his early discoveries in number theory with 
the aid of a new method, his purely algebraic approach to complex numbers. 
He defined these numbers as ordered pairs of real numbers with suitable 


21 A translation by A. Hiltebeitel and J. Morehead was published under the title General 
Investigations of Curved Surfaces by the Raven Press, Hewlett, New York, in 1965. 

22 These ideas are explained in nontechnical language in C. Lanczos, Albert Einstein and the 
Cosmic World Order, chap. 4, Interscience-Wiley, New York, 1965. 
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definitions for the algebraic operations, and in so doing laid to rest the con¬ 
fusion that still surrounded the subject and prepared the way for the later 
algebra and geometry of n-dimensional spaces. But this was only inciden¬ 
tal to his main purpose, which was to broaden the ideas of number theory 
into the complex domain. He defined complex integers (now called Gaussian 
integers) as complex numbers a + ib with a and b ordinary integers; he intro¬ 
duced a new concept of prime numbers, in which 3 remains prime but 
5 = (1 + 21) • (1 - 2 i) does not; and he proved the unique factorization theorem 
for these integers and primes. The ideas of this paper inaugurated algebraic 
number theory, which has grown steadily from that day to this. 23 

From the 1830s on. Gauss was increasingly occupied with physics, and he 
enriched every branch of the subject he touched. In the theory of surface 
tension, he developed the fundamental idea of conservation of energy and 
solved the earliest problem in the calculus of variations involving a double 
integral with variable limits. In optics, he introduced the concept of the focal 
length of a system of lenses and invented the Gauss wide-angle lens (which 
is relatively free of chromatic aberration) for telescope and camera objec¬ 
tives. He virtually created the science of geomagnetism, and in collabora¬ 
tion with his friend and colleague Wilhelm Weber he built and operated an 
iron-free magnetic observatory, founded the Magnetic Union for collecting 
and publishing observations from many places in the world, and invented 
the electromagnetic telegraph and the bifilar magnetometer. There are many 
references to his work in James Clerk Maxwell's famous Treatise on Electricity 
and Magnetism (1873). In his preface. Maxwell says that Gauss "brought his 
powerful intellect to bear on the theory of magnetism and on the methods of 
observing it, and he not only added greatly to our knowledge of the theory 
of attractions, but reconstructed the whole of magnetic science as regards 
the instruments used, the methods of observation, and the calculation of 
results, so that his memoirs on Terrestrial Magnetism may be taken as mod¬ 
els of physical research by all those who are engaged in the measurement of 
any of the forces in nature." In 1839 Gauss published his fundamental paper 
on the general theory of inverse square forces, which established potential 
theory as a coherent branch of mathematics. 24 As usual, he had been think¬ 
ing about these matters for many years; and among his discoveries were the 
divergence theorem (also called Gauss's theorem) of modern vector analysis, 
the basic mean value theorem for harmonic functions, and the very power¬ 
ful statement which later became known as "Dirichlet's principle" and was 
finally proved by Hilbert in 1899. 

We have discussed the published portion of Gauss's total achievement, but 
the unpublished and private part was almost equally impressive. Much of 


23 See E. T. Bell, "Gauss and the Early Development of Algebraic Numbers," National Math. 
Mag., vol. 18, pp. 188-204, 219-233 (1944). 

24 George Green's "Essay on the Application of Mathematical Analysis to the Theories of 
Electricity and Magnetism" (1828) was neglected and almost completely unknown until it 
was reprinted in 1846. 
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this came to light only after his death, when a great quantity of material 
from his notebooks and scientific correspondence was carefully analyzed 
and included in his collected works. His scientific diary has already been 
mentioned. This little booklet of 19 pages, one of the most precious docu¬ 
ments in the history of mathematics, was unknown until 1898, when it was 
found among family papers in the possession of one of Gauss's grandsons. 
It extends from 1796 to 1814 and consists of 146 very concise statements of 
the results of his investigations, which often occupied him for weeks or 
months. 25 All of this material makes it abundantly clear that the ideas Gauss 
conceived and worked out in considerable detail, but kept to himself, would 
have made him the greatest mathematician of his time if he had published 
them and done nothing else. 

For example, the theory of functions of a complex variable was one of the 
major accomplishments of nineteenth century mathematics, and the central 
facts of this discipline are Cauchy's integral theorem (1827) and the Taylor 
and Laurent expansions of an analytic function (1831,1843). In a letter writ¬ 
ten to his friend Bessel in 1811, Gauss explicitly states Cauchy's theorem and 
then remarks, "This is a very beautiful theorem whose fairly simple proof I 
will give on a suitable occasion. It is connected with other beautiful truths 
which are concerned with series expansions." 26 Thus, many years in advance 
of those officially credited with these important discoveries, he knew 
Cauchy's theorem and probably knew both series expansions. However, for 
some reason the "suitable occasion" for publication did not arise. A possible 
explanation for this is suggested by his comments in a letter to Wolfgang 
Bolyai, a close friend from his university years with whom he maintained 
a lifelong correspondence: "It is not knowledge but the act of learning, not 
possession but the act of getting there, which grants the greatest enjoyment. 
When I have clarified and exhausted a subject, then I turn away from it in 
order to go into darkness again." His was the temperament of an explorer, 
who is reluctant to take the time to write an account of his last expedition 
when he could be starting another. As it was. Gauss wrote a great deal; but 
to publish every fundamental discovery he made in a form satisfactory to 
himself would have required several long lifetimes. 

Another prime example is non-Euclidean geometry, which has been com¬ 
pared with the Copernican revolution in astronomy for its impact on the 
minds of civilized men. From the time of Euclid to the boyhood of Gauss, 
the postulates of Euclidean geometry were universally regarded as neces¬ 
sities of thought. Yet there was a flaw in the Euclidean structure that had 
long been a focus of attention: the so-called parallel postulate, stating that 
through a point not on a line there exists a single line parallel to the given 
line. This postulate was thought not to be independent of the others, and 
many had tried without success to prove it as a theorem. We now know that 


25 See Gauss's Werke, vol. X, pp. 483-574,1917. 

26 Werke, vol. VIII, p. 91,1900. 
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Gauss joined in these efforts at the age of fifteen, and he also failed. But he 
failed with a difference, for he soon came to the shattering conclusion— 
which had escaped all his predecessors—that the Euclidean form of geom¬ 
etry is not the only one possible. He worked intermittently on these ideas 
for many years, and by 1820 he was in full possession of the main theorems 
of non-Euclidean geometry (the name is due to him). 27 But he did not reveal 
his conclusions, and in 1829 and 1832 Lobachevsky and Johann Bolyai (son 
of Wolfgang) published their own independent work on the subject. One 
reason for Gauss's silence in this case is quite simple. The intellectual cli¬ 
mate of the time in Germany was totally dominated by the philosophy of 
Kant, and one of the basic tenets of his system was the idea that Euclidean 
geometry is the only possible way of thinking about space. Gauss knew 
that this idea was totally false and that the Kantian system was a structure 
built on sand. However, he valued his privacy and quiet life, and held his 
peace in order to avoid wasting his time on disputes with the philosophers. 
In 1829 he wrote as follows to Bessel: "I shall probably not put my very 
extensive investigations on this subject [the foundations of geometry] into 
publishable form for a long time, perhaps not in my lifetime, for I dread the 
shrieks we would hear from the Boeotians if I were to express myself fully 
on this matter." 28 

The same thing happened again in the theory of elliptic functions, a very 
rich field of analysis that was launched primarily by Abel in 1827 and also 
by Jacobi in 1828-1829. Gauss had published nothing on this subject, and 
claimed nothing, so the mathematical world was filled with astonishment 
when it gradually became known that he had found many of the results of 
Abel and Jacobi before these men were born. Abel was spared this devas¬ 
tating knowledge by his early death in 1829, at the age of twenty-six, but 
Jacobi was compelled to swallow his disappointment and go on with his 
work. The facts became known partly through Jacobi himself. His attention 
was caught by a cryptic passage in the Disqirisitiones (Article 335), whose 
meaning can only be understood if one knows something about elliptic 
functions. He visited Gauss on several occasions to verify his suspicions 
and tell him about his own most recent discoveries, and each time Gauss 
pulled 30-year-old manuscripts out of his desk and showed Jacobi what 
Jacobi had just shown him. The depth of Jacobi's chagrin can readily be 
imagined. At this point in his life Gauss was indifferent to fame and was 
actually pleased to be relieved of the burden of preparing the treatise on 
the subject which he had long planned. After a week's visit with Gauss in 
1840, Jacobi wrote to his brother, "Mathematics would be in a very different 
position if practical astronomy had not diverted this colossal genius from 
his glorious career." 


27 Everything he is known to have written about the foundations of geometry was published 
in his Werke, vol. VIII, pp. 159-268,1900. 

28 Werke, vol. VIII, p. 200. The Boeotians were a dull-witted tribe of the ancient Greeks. 
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Such was Gauss, the supreme mathematician. He surpassed the levels of 
achievement possible for ordinary men of genius in so many ways that one 
sometimes has the eerie feeling that he belonged to a higher species. 


Appendix D. Chebyshev Polynomials 
and the Minimax Property 


In Problem 31-6 we defined the Chebyshev polynomials T n (x) in terms of 

f i \-x' 

the hypergeometric function by T n (x) = Fl n - n, —,- 


I, where n = 0,1,2, 


Needless to say, this definition by itself tells us practically nothing, for the 
question that matters is: what purpose do these polynomials serve? We will 
now try to answer this question. 

It is convenient to begin by adopting a different definition for the poly¬ 
nomials T n (x). We will see later that the two definitions agree. Our starting 
point is the fact that if n is a nonnegative integer, then de Moivre's formula 
from the theory of complex numbers gives 


cos zz9 + i sin zz9 = (cos 9 + i sin 9)' 1 

= cos" 9 + n cos" -1 9(z sin 9) 

+ n( - n ~ V > cos'- 2 9(z sin 9) 2 + • • • + (z sin 9)", (1) 


so cos zz9 is the real part of the sum on the right. Now the real terms in 
this sum are precisely those that contain even powers of z sin 9; and since 
sin 2 9 = 1- cos 2 9, it is apparent that cos zz9 is a polynomial function of cos 9. 
We use this as the definition of the zzth Chebyshev polynomial: T n (x) is that 
polynomial for which 


cos zz9 = T„(cos 9). (2) 

Since T„(x) is a polynomial, it is defined for all values of x. However, if x is 
restricted to lie in the interval -1 < x < 1 and we write x = cos 9 where 9 < 9 < zr, 
then (2) yields 

T„(x) = cos (n cos -1 x). (3) 

With the same restrictions, we can obtain another curious expression for 
T„(x). For on adding the two formulas 
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cos nQ ± i sin zz0 = (cos 0 ± i sin 0)", 

we get 

-1 

cos nQ = — |^(cos0 + zsin0)" + (cos0-zsin0)"^ 

= ^ [(cos 0 + z’Vl - cos 2 0)" + (cos 0 — / V1 - cos 2 0)" ] 
= ^ [(cos 0 + \l cos 2 0 -1)" + (cos 0 - V cos 2 0 -1)" ], 


so 


T„(x) = |[(x + V* 2 -l)" + (x-Vx 2 -l) n ] 


( 4 ) 


Another explicit expression for T„(x) can be found by using the binomial for¬ 
mula to write (1) as 


cos nQ + z’sin n 9 = | jcos" m 0(zsin0) m . 


We have remarked that the real terms in this sum correspond to the even 
values of m, that is, to m = 2k where k = 0,1, 2,.. [n/ 2]. 29 Since 

(z sin 0) m = (z sin 0) a = (-l) l (l - cos 2 O)* 1 = (cos 2 0 - l) k , 


we have 


WVf n \ 

C°S”0=Z 2k — n 2ka '— 2 ° ^ 

k=o v z V 


cos"- 2 * 0(cos 2 0 - l) k , 


and therefore 


In/ 2 ] 


n{x) = ^ 


n\ 


' (2k)\(n-2k)\ 


x n ~ 2k (x 2 -l) k . 


( 5 ) 


29 The symbol [n/2] is the standard notation for the greatest integer < n/2. 
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It is clear from (4) that T 0 (x) = 1 and Tfx)-x; but for higher values of n, T„(x) is 
most easily computed from a recursion formula. If we write 

cos nQ = cos [0 + (n - 1)0] = cos 0 cos (n -1)0 - sin 0 sin in - 1)0 


and 


cos (n- 2)0 = cos [-0 + (n -1) 0] 

= cos 0 cos (n -1) 0 + sin 0 sin (n -1) 0, 


then it follows that 


cos nQ + cos (n - 2)0 = 2 cos 0 cos (n -1)0. 

If we use (2) and replace cos 0 by x, then this trigonometric identity gives the 
desired recursion formula: 

T„(x) + T„_ 2 (x) = 2xT„_ 1 (x). (6) 

By starting with T 0 (x) = l and Tj(x) = x, we find from (6) that T 2 (x) = 2x 2 -1, 
T 3 (x) = 4x 3 - 3x, T 4 (x) = 8x 4 - 8x 2 +1, and so on. 

The hypergeometric form. To establish a connection between Chebyshev's 
differential equation and the Chebyshev polynomials as we have just 
defined them, we use the fact that the polynomial y = T n (x) becomes the 
function y = cos nQ when the variable is changed from x to 0 by means of 
x = cos 0. Now the function y = cos nQ is clearly a solution of the differential 
equation 


d 2 y 

W 


+ n 2 y = 0, 


( 7 ) 


and an easy calculation shows that changing the variable from 0 back to x 
transforms (7) into Chebyshev's equation 

( l_x 2 )^-x^ + « 2 y = 0. (8) 

ax dx 

We therefore know that y = T„(x) is a polynomial solution of (8). But 
Problem 31-6 tells us that the only polynomial solutions of (8) have the 
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form cF 



1 -x 

~T~ 


; and since (4) implies that T, I (1) = 1 for every n, and 


cF 




= c, we conclude that 


T„(x) = F 




( 9 ) 


Orthogonality. One of the most important properties of the functions 
y„(0) = cos H0 for different values of n is their orthogonality on the interval 
0 < 0 < ir, that is, the fact that 

n n 

J* y m y„dQ =j*cos m0 cos nd d0 = 0 if m*n. (10) 

o o 

To prove this, we write down the differential equations satisfied by y m = cos mQ 
and t/„ = cos n0: 


y" m + m 2 y m = 0 and y" + n 2 y„ = 0. 

On multiplying the first of these equations by y n and the second by y m , and 
subtracting, we obtain 

(y«y« - y'ny m ) + (m 2 - n 2 )y m y n = 0; 

£10 

and (10) follows at once by integrating each term of this equation from 0 to jt, 
since y' m and y' n both vanish at the endpoints and m 2 -n 2 j= 0. 

When the variable in (10) is changed from 0 to x = cos 0, (10) becomes 

[ T m (x)T„(x) dx _Q ifm^n. ( 11 ) 

j Vw 


This fact is usually expressed by saying that the Chebyshev polynomials 
are orthogonal on the interval -1 < x < 1 with respect to the weight function 
(1 - x^) -112 . When m = n in (11), we have 


1 


] ^MLdx=< 

Vl-X 2 


71 

2 


71 


for n ^ 0, 
for n =0. 


( 12 ) 
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These additional statements follow from 


n 

1 


cos 2 n0 dQ = 


for n * 0, 
for n = 0, 


which are easy to establish by direct integration. 

Just as in the case of the Hermite polynomials discussed in Appendix B, 
the orthogonality properties (11) and (12) can be used to expand an "arbi¬ 
trary" function/(x) in a Chebyshev series: 


/(x) = J\„r n (x). (13) 

n =0 


The same formal procedure as before yields the coefficients 



/(*) 

-y/l-X 2 


dx 


(14) 


and 


(15) 

^ J Vl-X 2 

for n > 0. And again the true mathematical issue is the problem of finding 
conditions under which the series (13)—with the a n defined by (14) and (15)— 
actually converges to/(x). 


The minimax property. The Chebyshev problem we now consider is to see 
how closely the function x” can be approximated on the interval 1 < x < 1 by 
polynomials a n _ l x n ~ l +■■■+ape+ a 0 of degree n- 1; that is, to see how small the 
number 


max x” - fl„_ 1 x” 1 - a x x - a 0 

-1<X<1 1 


can be made by an appropriate choice of the coefficients. This in turn is equiva¬ 
lent to the following problem: among all polynomials P(x) = x n + a n _ x x n ~^ + • ■ • + 
fljX + a 0 of degree n with leading coefficient 1, to minimize the number 

max |P(x)|/ 
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and if possible to find a polynomial that attains this minimum value. 

It is clear from Tfx)-x and the recursion formula (6) that when n > 0 the 
coefficient of x" in T n (x) is 2" _1 , so 2 1_ "T„(x) has leading coefficient 1. These 
polynomials completely solve Chebyshev's problem, in the sense that they 
have the following remarkable property. 


Minimax property. Among all polynomials P(x) of degree n> 0 zvith leading coef¬ 
ficient 1, 2 1_ "T n (x) deviates least from zero in the interval -1 < x < 1: 

max |P(x)| > max |2 1_ "T n (x)| = 2 1_ ". (16) 

1 1 _lcv<rl I 


Proof. First, the equality in (16) follows at once from 

max|T„(x)| = max|cosn9| = 1. 

-1<X<1 1 1 o<e<n 


To complete the argument, we assume that P(x) is a polynomial of the stated 
type for which 


max |P(x)| < 2 1 ", 

-1<x<1 


(17) 


and we deduce a contradiction from this hypothesis. We begin by noticing 
that the polynomial 2 1 ~"T„(x) - 2 1_ ” cos nd has the alternately positive and neg¬ 
ative values 2 1- ", 2 1_n , ..., ±2 1_ " at the n +1 points x that correspond to 

0 = 0, jr/n, 2-iz/n, ..., m/n = n. By assumption (17), Q(x) = 2 1 ~"T„(x)-P(x) has the 
same sign as 2 1_ "T„(x) at these points, and must therefore have at least n zeros 
in the interval -1 < x < 1. But this is impossible since Q(x) is a polynomial of 
degree at most n -1 which is not identically zero. 


In this very brief treatment the minimax property unfortunately seems 
to appear out of nowhere, with no motivation and no hint as to why the 
Chebyshev polynomials behave in this extraordinary way. We hope the 
reader will accept our assurance that in the broader context of Chebyshev's 
original ideas this surprising property is really quite natural. 30 For those who 
like their mathematics to have concrete applications, it should be added that 
the minimax property is closely related to the important place Chebyshev 
polynomials occupy in contemporary numerical analysis. 


30 Those readers who are blessed with indomitable skepticism, and rightly refuse to accept 
assurances of this kind without personal investigation, are invited to consult N. I. Achieser, 
Theory of Approximation, Ungar, New York, 1956; E. W. Cheney, Introduction to Approximation 
Theory, McGraw-Hill, New York, 1966; or G. G. Lorentz, Approximation of Functions, Holt, 
New York, 1966. 
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NOTE ON CHEBYSHEV. Pafnuty Lvovich Chebyshev (1821-1894) was the 
most eminent Russian mathematician of the nineteenth century. He was 
a contemporary of the famous geometer Lobachevsky (1793-1856), but his 
work had a much deeper influence throughout Western Europe and he is 
considered the founder of the great school of mathematics that has been 
flourishing in Russia for the past century. 

As a boy he was fascinated by mechanical toys, and apparently was first 
attracted to mathematics when he saw the importance of geometry for under¬ 
standing machines. After his student years in Moscow, he became professor 
of mathematics at the University of St. Petersburg, a position he held until 
his retirement. His father was a member of the Russian nobility, but after 
the famine of 1840 the family estates were so diminished that for the rest of 
his life Chebyshev was forced to live very frugally and he never married. He 
spent much of his small income on mechanical models and occasional jour¬ 
neys to Western Europe, where he particularly enjoyed seeing windmills, 
steam engines, and the like. 

Chebyshev was a remarkably versatile mathematician with a rare talent 
for solving difficult problems by using elementary methods. Most of his 
effort went into pure mathematics, but he also valued practical applica¬ 
tions of his subject, as the following remark suggests: "To isolate math¬ 
ematics from the practical demands of the sciences is to invite the sterility 
of a cow shut away from the bulls." He worked in many fields, but his 
most important achievements were in probability, the theory of numbers, 
and the approximation of functions (to which he was led by his interest in 
mechanisms). 

In probability, he introduced the concepts of mathematical expectation 
and variance for sums and arithmetic means of random variables, gave a 
beautifully simple proof of the law of large numbers based on what is now 
known as Chebyshev's inequality, and worked extensively on the central 
limit theorem. He is regarded as the intellectual father of a long series of 
well-known Russian scientists who contributed to the mathematical theory 
of probability, including A. A. Markov, S. N. Bernstein, A. N. Kolmogorov, 
A. Y. Khinchin, and others. 

In the late 1840s Chebyshev helped to prepare an edition of some of the 
works of Euler. It appears that this task caused him to turn his attention 
to the theory of numbers, particularly to the very difficult problem of the 
distribution of primes. As the reader probably knows, a prime number is 
an integer p > 1 that has no positive divisors except 1 and p. The first few 
are easily seen to be 2, 3, 5, 7,11,13, 17,19, 23, 29, 31, 37, 41, 43, .... It is clear 
that the primes are distributed among all the positive integers in a rather 
irregular way; for as we move out, they seem to occur less and less fre¬ 
quently, and yet there are many adjoining pairs separated by a single even 
number. The problem of discovering the law governing their occurrence— 
and of understanding the reasons for it—is one that has challenged the 
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curiosity of men for hundreds of years. In 1751 Euler expressed his own 
bafflement in these words: "Mathematicians have tried in vain to this day 
to discover some order in the sequence of prime numbers, and we have 
reason to believe that it is a mystery into which the human mind will 
never penetrate." 

Many attempts have been made to find simple formulas for the nth 
prime and for the exact number of primes among the first n positive inte¬ 
gers. All such efforts have failed, and real progress was achieved only 
when mathematicians started instead to look for information about the 
average distribution of the primes among the positive integers. It is cus¬ 
tomary to denote by it(x) the number of primes less than or equal to a 
positive number x. Thus; it(l) = 0, it(2) = 1, ji(3) = 2, jt(ji) = 2, jr(4) = 2, and so on. 
In his early youth Gauss studied it(x) empirically, with the aim of finding 
a simple function that seems to approximate it with a small relative error 
for large x. On the basis of his observations he conjectured (perhaps at the 
age of fourteen or fifteen) that x/log x is a good approximating function, 
in the sense that 


lim 7t(x) =1. (18) 

x/log X 


This statement is the famous prime number theorem ; and as far as anyone 
knows. Gauss was never able to support his guess with even a fragment of 
proof. 

Chebyshev, unaware of Gauss's conjecture, was the first mathematician 
to establish any firm conclusions about this question. In 1848 and 1850 he 
proved that 


0.9213...< n ^ <1.1055... 
x/log x 


(19) 


for all sufficiently large x, and also that if the limit in (18) exists, then its value 
must be l. 31 As a by-product of this work, he also proved Bertrand's postu¬ 
late: for every integer n > 1 there is a prime p such that n < p < 2n. Chebyshev's 
efforts did not bring him to a final proof of the prime number theorem (this 
came in 1896), but they did stimulate many other mathematicians to continue 
working on the problem. We shall return to this subject in Appendix E, in 
our note on Riemann. 


iii i 

The number on the left side of (19) is A = log 2 3 3 3 5 3 30 30 , and that on the right is 


6 

5 


A. 


31 
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Appendix E. Riemann's Equation 

Our purpose in this appendix is to understand the structure of Gauss's 
hypergeometric equation 

x(l - x)y" + [c - (a + b +1 )x]y' -aby = 0. (1) 


In Sections 31 and 32 we saw that this equation has exactly three regular 
singular points x = 0, x = 1, and x = °°, and also that at least one exponent has 
the value 0 at each of the points x = 0 and x=l. We shall prove that (1) is fully 
determined by these properties, in the sense that if we make these assump¬ 
tions about the general equation 

y" + P(x)y' + Q(x)y = 0, (2) 

then (2) necessarily has the form (1). 

We begin by recalling from Section 32 that if the independent variable in 
(2) is changed from x to t = 1/x, then (2) becomes 


y" + 


2 P(l/t) 


t 


t 1 




( 3 ) 


where the primes denote derivatives with respect to t. It is clear from (3) that 
the point x = °° is a regular singular point of (2) if it is not an ordinary point 
and the functions 


1 

t 



and 



are both analytic at t - 0. 

We now explicitly assume that (2) has x = 0, x = 1, and x - °° as regular singu¬ 
lar points and that all other points are ordinary It follows that xP(x) is ana¬ 
lytic at x = 0, that (x - l)P(x) is analytic at x = 1, and that x(x - 1)-P(x) is analytic 
for all finite values of x: 


x(x-l)P(x) = 

n =0 

If we substitute x = 1/t, then (4) becomes 



( 4 ) 
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Since x = °° is a regular singular point of (2), this function must be analytic at 
t = 0. We conclude that a 2 - a 3 = • • • = 0, so (4) yields 


p(x)= «o±M = A + ^ 
x(x-l) x x — 1 


( 5 ) 


for certain constants A and B. Similarly x 2 (x - l) 2 Q(x) is analytic for all finite 
values of x, so 


and 


x 2 (x-l) 2 Q(x) = y\x”, 

n =0 



( 6 ) 


As before, the assumption that x = °° is a regular singular point of (2) implies 
that (6) must be analytic at t - 0, so b 3 - b 4 - ■ ■ ■ = 0 and 


. b 0 + b\X + b 2 x 2 C D E F 

Q(X ) = \ = — + + - + - ¥ . 

x 2 (x-l)" X x~ x — 1 (x-1) 2 


( 7 ) 


Now the fact that (6) is bounded near f = 0 means that x 2 Q(x) bounded for 
large x, so 
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is also bounded and C + E = 0. This enables us to write (7) as 

CM D F C 

x 2 (x —1)“ x(x-l) / 

and in view of (5) and (8), equation (2) takes the form 


y + -+ 


X x — 1 


y + 


c 


D 


x' (x — l) 2 x(x-l) 


y= 0- 


( 8 ) 


( 9 ) 


Let the exponents belonging to the regular singular points 0, 1, and °° be 
denoted by oq and a 2 , (), and p 2 , Yi and y 2 , respectively These numbers are the 
roots of the indicial equations at these three points: 

m(m -1) + Am + D = 0, 

m(m -1) + Bm + F= 0, 

m(m - 1) + (2- A - B)m + (D + F-C)- 0. 

The first two of these equations can be written down directly by inspecting 
(9), but the third requires a little calculation based on (3). If we write these 
equations as 


m 2 +(A - 1 )m + D = 0, 
m 2 + (B- 1 )m + F = 0, 
m 2 + (l-A-B)m + (D + F-C) = 0, 

then by the well-known relations connecting the roots of a quadratic equa¬ 
tion with its coefficients, we obtain 

oq + a 2 = 1 - A, oq a 2 = D, 

p 1 + p 2 = l-B, PiP 2 = F, (10) 

Yi + y 2 = A + B-l, Y1Y2 = D + F-C. 

It is clear from the first column that 


«i + «2+Pi + P2 + Yi + Y2 = 1; 


( 11 ) 
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and by using (10), we can write (9) in the form 


l-cg -q 2 | 1 - Pi ~ P 2 


x x — 1 


x-1 



0.10.2 | P1P2 | Y 1 Y 2 - 0.10.2 — P1P1 
x 2 (x-1) 2 x(x — 1) 


( 12 ) 


This is called Riemann's equation, and (11) is known as Riemann's identity. 

The qualitative content of this remarkable conclusion can be expressed as 
follows: the precise form of (2) is completely determined by requiring that it 
have only three regular singular points x = 0, x = 1, and x = °° and by specify¬ 
ing the values of its exponents at each of these points. 

Let us now impose the additional condition that at least one exponent must 
have the value 0 at each of the points x = 0 and x = 1, say a, = (!, = 0. Then with a 
little simplification and the aid of (11), Riemann's equation reduces to 


x(l - x)y” + [(1 - a,) - ( Yl + y 2 +1 )Ay' ~ YiY 2 J/ = 0, 


which clearly becomes Gauss's equation (1) if we introduce the customary 
notation a = Yl , b - y 2 , c = l-oe 2 . For this reason, equation (12) is sometimes 
called the generalized hypergeometric equation. 

These results are merely the first few steps in a far-reaching theory of dif¬ 
ferential equations initiated by Riemann. One of the aims of this theory is 
to characterize in as simple a manner as possible all differential equations 
whose solutions are expressible in terms of Gauss's hypergeometric func¬ 
tion. Another is to achieve a systematic classification of all differential equa¬ 
tions with rational coefficients according to the number and nature of their 
singular points. One surprising fact that emerges from this classification is 
that virtually all such equations arising in mathematical physics can be gen¬ 
erated by confluence from a single equation with five regular singular points 
in which the difference between the exponents at each point is 1/2. 32 

NOTE ON RIEMANN. No great mind of the past has exerted a deeper influ¬ 
ence on the mathematics of the twentieth century than Bernhard Riemann 
(1826-1866), the son of a poor country minister in northern Germany. He 
studied the works of Euler and Legendre while he was still in secondary 
school, and it is said that he mastered Legendre's treatise on the theory of 


32 A full understanding of these further developments requires a grasp of the main principles 
of complex analysis. Nevertheless, a reader without this equipment can glean a few useful 
impressions from E. T. Whittaker and G. N. Watson, Modern Analysis, pp 203-208, Cambridge 
University Press, London, 1935; or E. D. Rainville, Intermediate Differential Equations, chap. 6, 
Macmillan, New York, 1964. 
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numbers is less than a week. But he was shy and modest, with little aware¬ 
ness of his own extraordinary abilities, so at the age of nineteen he went to 
the University of Gottingen with the aim of pleasing his father by studying 
theology and becoming a minister himself. Fortunately this worthy purpose 
soon stuck in his throat, and with his father's willing permission he switched 
to mathematics. 

The presence of the legendary Gauss automatically made Gottingen 
the center of the mathematical world. But Gauss was remote and 
unapproachable—particularly to beginning students—and after only a year 
Riemann left this unsatisfying environment and went to the University of 
Berlin. There he attracted the friendly interest of Dirichlet and Jacobi, and 
learned a great deal from both men. Two years later he returned to Gottingen, 
where he obtained his doctor's degree in 1851. During the next eight years 
he endured debilitating poverty and created his greatest works. In 1854 he 
was appointed Privatdozent (unpaid lecturer), which at that time was the 
necessary first step on the academic ladder. Gauss died in 1855, and Dirichlet 
was called to Gottingen at his successor. Dirichlet helped Riemann in every 
way he could, first with a small salary (about one-tenth of that paid to a full 
professor) and then with a promotion to an assistant professorship. In 1859 
he also died, and Riemann was appointed as a full professor to replace him. 
Riemann's years of poverty were over, but his health was broken. At the age 
of thirty-nine he died of tuberculosis in Italy, on the last of several trips he 
undertook in order to escape the cold, wet climate of northern Germany. 
Riemann had a short life and published comparatively little, but his works 
permanently altered the course of mathematics in analysis, geometry, and 
number theory. 33 

His first published paper was his celebrated dissertation of 1851 on the gen¬ 
eral theory of functions of a complex variable. 34 Riemann's fundamental aim 
here was to free the concept of an analytic function from any dependence 
on explicit expressions such as power series, and to concentrate instead on 
general principles and geometric ideas. He founded his theory on what are 
now called the Cauchy-Riemann equations, created the ingenious device of 
Riemann surfaces for clarifying the nature of multiple-valued functions, and 
was led to the Riemann mapping theorem. Gauss was rarely enthusiastic 
about the mathematical achievements of his contemporaries, but in his offi¬ 
cial report to the faculty he warmly praised Riemann's work: "The disserta¬ 
tion submitted by Herr Riemann offers convincing evidence of the author's 
thorough and penetrating investigations in those parts of the subject treated 
in the dissertation, of a creative, active, truly mathematical mind, and of a 
gloriously fertile originality." 


33 His Gesammelte Mathematische Werke (reprinted by Dover in 1953) occupy only a single vol¬ 
ume, of which two-thirds consists of posthumously published material. Of the nine papers 
Riemann published himself, only five deal with pure mathematics. 

34 Grundlagen fur eine allgemeine Theorie der Functionen einer veranderlichen complexen 
Grosse, in Werke, pp. 3-43. 
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Riemann later applied these ideas to the study of hypergeometric and 
Abelian functions. In his work on Abelian functions he relied on a remark¬ 
able combination of geometric reasoning and physical insight, the latter in 
the form of Dirichlet's principle from potential theory. He used Riemann sur¬ 
faces to build a bridge between analysis and geometry which made it possible 
to give geometric expression to the deepest analytic properties of functions. 
His powerful intuition often enabled him to discover such properties—for 
instance, his version of the Riemann-Roch theorem—by simply thinking 
about possible configurations of closed surfaces and performing imaginary 
physical experiments on these surfaces. Riemann's geometric methods in 
complex analysis constituted the true beginning of topology, a rich field of 
geometry concerned with those properties of figures that are unchanged by 
continuous deformations. 

In 1854 he was required to submit a probationary essay in order to be 
admitted to the position of Privatdozent, and his response was another preg¬ 
nant work whose influence is indelibly stamped on the mathematics of our 
own time. 35 The problem he set himself was to analyze Dirichlet's condi¬ 
tions (1829) for the representability of a function by its Fourier series. One 
of these conditions was that the function must be integrable. But what does 
this mean? Dirichlet had used Cauchy's definition of integrability, which 
applies only to functions that are continuous or have at most a finite number 
of points of discontinuity. Certain functions that arise in number theory sug¬ 
gested to Riemann that this definition should be broadened. He developed 
the concept of the Riemann integral as it now appears in most textbooks 
on calculus, established necessary and sufficient conditions for the existence 
of such an integral, and generalized Dirichlet's criteria for the validity of 
Fourier expansions. Cantor's famous theory of sets was directly inspired 
by a problem raised in this paper, and these ideas led in turn to the con¬ 
cept of the Lebesgue integral and even more general types of integration. 
Riemann's pioneering investigations were therefore the first steps in another 
new branch of mathematics, the theory of functions of a real variable. 

The Riemann rearrangement theorem in the theory of infinite series was an 
incidental result in the paper just described. He was familiar with Dirichlet's 
example showing that the sum of a conditionally convergent series can be 
changed by altering the order of its terms: 


,111111 

1 — +-+-+ — 

2 3 4 5 6 7 


8 


+ • ■ • = log 2, 


(13) 


1 + 


1 

3 


1 1 
—+ —+ 
2 5 


1 

7 




(14) 


35 Ueber die Darstellbarkeit einer Function durch eine trigonometrische Reihe, in Werke, 
pp. 227-264. 
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It is apparent that these two series have different sums but the same terms; 
for in (14) the first two positive terms in (13) are followed by the first negative 
term, then the next two positive terms are followed by the second negative 
term, and so on. Riemann proved that it is possible to rearrange the terms of 
any conditionally convergent series in such a manner that the new series will 
converge to an arbitrary preassigned sum, or diverge to °° or 

In addition to his probationary essay, Riemann was also required to pres¬ 
ent a trial lecture to the faculty before he could be appointed to his unpaid 
lectureship. It was the custom for the candidate to offer three titles, and the 
head of his department usually accepted the first. However, Riemann rashly 
listed as his third topic the foundations of geometry, a profound subject on 
which he was unprepared but which Gauss had been turning over in his 
mind for 60 years. Naturally, Gauss was curious to see how this particular 
candidate's "gloriously fertile originality" would cope with such a challenge, 
and to Riemann's dismay he designated this as the subject of the lecture. 
Riemann quickly tore himself away from his other interests at the time— 
"my investigations of the connection between electricity, magnetism, light, 
and gravitation"—and wrote his lecture in the next two months. The result 
was one of the great classical masterpieces of mathematics, and probably the 
most important scientific lecture ever given. 36 It is recorded that even Gauss 
was surprised and enthusiastic. 

Riemann's lecture presented in nontechnical language a vast generaliza¬ 
tion of all known geometries, both Euclidean and non-Euclidean. This field 
is now called Riemannian geometry; and apart from its great importance in 
pure mathematics, it turned out 60 years later to be exactly the right frame¬ 
work for Einstein's general theory of relativity. Like most of the great ideas 
of science, Riemannian geometry is quite easy to understand if we set aside 
the technical details and concentrate on its essential features. Let us recall 
the intrinsic differential geometry of curved surfaces which Gauss had dis¬ 
covered 25 years earlier. If a surface imbedded in three dimensional space is 
defined parametrically by three functions x-x(u,v), y=y(u,v), and z=z(n,v), 
then u and v can be interpreted as the coordinates of points on the sur¬ 
face. The distance ds along the surface between two nearby points (u,v) and 
(u + du,v + dv) is given by Gauss's quadratic differential form 

ds 2 = E du 2 + 2F du dv + G dv 2 , 

where E, F, and G are certain functions of u and v. This differential form 
makes it possible to calculate the lengths of curves on the surface, to find 
the geodesic (or shortest) curves, and to compute the Gaussian curvature 
of the surface at any point—all in total disregard of the surrounding space. 
Riemann generalized this by discarding the idea of a surrounding Euclidean 


36 Ueber die Hypothesen, Welche der Geometrie zu Grunde liegen, in Werke, pp. 272-286. There 
is a translation in D. E. Smith, A Source Book in Mathematics, McGraw-Hill, New York, 1929. 
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space and introducing the concept of a continuous ^-dimensional manifold 
of points (xj, x 2 , ..., x„). He then imposed an arbitrarily given distance (or 
metric) ds between nearby points 


(x v x 2 , ..., x„) and (xj + dx y x 2 + dx 2 , ..., x n + dx n ) 


by means of a quadratic differential form 


n 



(15) 


where the y, ; are suitable functions ofx j x 2 , ..x„ and different systems of y, ( 
define different Riemannian geometries on the manifold under discussion. 
His next steps were to examine the idea of curvature for these Riemannian 
manifolds and to investigate the special case of constant curvature. All of 
this depends on massive computational machinery, which Riemann mer¬ 
cifully omitted from his lecture but included in a posthumous paper on 
heat conduction. In that paper he explicitly introduced the Riemann cur¬ 
vature tensor, which reduces to the Gaussian curvature when n = 2 and 
whose vanishing he showed to be necessary and sufficient for the given 
quadratic metric to be equivalent to a Euclidean metric. From this point 
of view, the curvature tensor measures the deviation of the Riemannian 
geometry defined by formula (15) from Euclidean geometry. Einstein has 
summarized these ideas in a single statement: "Riemann's geometry of an 
n-d i mensiona I space bears the same relation to Euclidean geometry of an 
^-dimensional space as the general geometry of curved surfaces bears to the 
geometry of the plane." 

The physical significance of geodesics appears in its simplest form as the 
following consequence of Hamilton's principle in the calculus of variations: 
if a particle is constrained to move on a curved surface, and if no force acts 
on it, then it glides along a geodesic. 37 A direct extension of this idea is the 
heart of the general theory of relativity, which is essentially a theory of gravi¬ 
tation. Einstein conceived the geometry of space as a Riemannian geometry 
in which the curvature and geodesics are determined by the distribution of 
matter; in this curved space, planets move in their orbits around the sun by 
simply coasting along geodesics instead of being pulled into curved paths by 
a mysterious force of gravity whose nature no one has ever really understood. 

In 1859 Riemann published his only work on the theory of numbers, a brief 
but exceedingly profound paper of less than 10 pages devoted to the prime 
number theorem. 38 This mighty effort started tidal waves in several branches 


37 This is proved in Appendix B of Chapter 12. 

38 Ueber die Anzahl der Primzahlen unter einer gegebenen Grosse, in Werke, pp. 145-153. See 
the statement of the prime number theorem in our note on Chebyshev in Appendix D. 
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of pure mathematics, and its influence will probably still be felt a thousand 
years from now. His starting point was a remarkable identity discovered by 
Euler over a century earlier: if s is a real number greater than 1, then 




WY 


(16) 


where the expression on the right denotes the product of the numbers 
(1 - p~ s )~ l for all primes p. To understand how this identity arises, we note 
that l/(l-x) = l+x + x 2 + for |x|< 1, so for each p we have 


1 

MW) 


= 1 + 



l 



On multiplying these series for all primes p and recalling that each integer 
77 > 1 is uniquely expressible as a product of powers of different primes, we 
see that 


n 


i 

i-(i/p s ) 



= i+ 



i 

+•••+—+•• 
n 


z 


n =1 


l 


which is the identity (16). The sum of the series on the left of (16) is evidently 
a function of the real variable s > 1, and the identity establishes a connection 
between the behavior of this function and properties of the primes. Euler 
himself exploited this connection in several ways, but Riemann perceived 
that access to the deeper features of the distribution of primes can only be 
gained by allowing s to be a complex variable. He denoted the resulting 
function by ^(s), and it has since been known as the Riemann zeta function: 

1 1 

c(s) = 1 H-!- \ -, S = G + it . 

In his paper he proved several important properties of this function, and in 
a sovereign way simply stated a number of others without proof. During the 
century since his death, many of the finest mathematicians in the world have 
exerted their strongest efforts and created rich new branches of analysis in 
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attempts to prove these statements. The first success was achieved in 1893 by 
J. Hadamard, and with one exception every statement has since been settled 
in the sense Riemann expected. 39 This exception is the famous Riemann 
hypothesis: that all the zeros of £(s) in the strip 0 < o < 1 lie on the central line 
1 

a = —. It stands today as the most important unsolved problem of mathemat¬ 
ics, and is probably the most difficult problem that the mind of man has 
ever conceived. In a fragmentary note found among his posthumous papers, 
Riemann wrote that these theorems "follow from an expression for the func¬ 
tion i;(s) which I have not yet simplified enough to publish." 40 Writing about 
this fragment in 1944, Hadamard remarked with justified exasperation, "We 
still have not the slightest idea of what the expression could be." 41 He adds 
the further comment: "In general, Riemann's intuition is highly geometrical; 
but this is not the case for his memoir on prime numbers, the one in which 
that intuition is the most powerful and mysterious." 


39 Hadamard's work led him to his 1896 proof of the prime number theorem. See E. C. 
Titchmarsh, The Theory of the Riemann Zeta Function, chap. 3, Oxford University Press, 
London, 1951. This treatise has a bibliography of 326 items. 

40 Werke, p. 154. 

41 The Psychology of Invention in the Mathematical Field, p. 118, Dover, New York, 1954. 
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Fourier Series and Orthogonal Functions 


33 The Fourier Coefficients 

Trigonometric series of the form 



( 1 ) 


n= 1 


are needed in the treatment of many physical problems that lead to partial 
differential equations, for instance, in the theory of sound, heat conduction, 
electromagnetic waves, and mechanical vibrations. 1 We shall examine some 
of these applications in the next chapter The representation of functions by 
power series is familiar to us from calculus and also from our work in the 
preceding chapter. An important advantage of the series (1) is that it can rep¬ 
resent very general functions with many discontinuities—like the discontin¬ 
uous "impulse" functions of electrical engineering—whereas power series 
can represent only continuous functions that have derivatives of all orders. 

Aside from the great practical value of trigonometric series for solving 
problems in physics and engineering, the purely theoretical part of this sub¬ 
ject has had a profound influence on the general development of mathemati¬ 
cal analysis over the past 250 years. Specifically, it provided the main driving 
force behind the evolution of the modern notion of function, which in all its 
ramifications is certainly the central concept of mathematics; it led Riemann 
and Lebesgue to create their successively more powerful theories of integra¬ 
tion, and Cantor his theory of sets; it led Weierstrass to his critical study of 
the real number system and the properties of continuity and differentiability 
for functions; and it provided the context within which the geometric idea of 
orthogonality (perpendicularity) was able to develop into one of the major 
unifying concepts of modern analysis. We shall comment further on all of 
these matters throughout this chapter. 


1 It is only for reasons of convenience that the constant term in (1) is written — a 0 instead of a 0 . 
This will become clear below. 
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We begin our treatment with some classical calculations that were first 
performed by Euler. Our point of view is that the function/(x) in (1) is defined 
on the closed interval -it < x < it, and we must find the coefficients a n and b n in 
the series expansion. It is convenient to assume, temporarily, that the series is 
uniformly convergent, because this implies that the series can be integrated 
term by term from -it to it. 2 

Since 


1 


cos nx dx = 0 


—n 


and 


1 


sinnxdx = 0 


-71 


( 2 ) 


for n = 1 , 2 ,..., the term-by-term integration yields 


J* f(x) dx 


-n 


— a 0 n, 


so 


«0 — 



( 3 ) 


i 

It is worth noticing here that formula (3) shows that the constant term — a 0 in 

(1) is simply the average value of f(x) over the interval. The coefficient a n is 
found in a similar way. Thus, if we multiply (1) by cos nx the result is 


f(x) cos nx = —a 0 cos nx + 


+ a„ cos z nx + ■ 


( 4 ) 


where the terms not written contain products of the form sin mx cos nx or of 
the form cos mx cos nx with m±n. At this point it is necessary to recall the 
trigonometric identities 


sin mx cos nx = 

cos mx cos nx = 

sin mx sin nx = 


^[sin 

|[cos 

|[cos 


(m + n)x + sin (m - n)x], 
(m + ri)x + cos (m - n)x], 
(m - ri)x - cos (m + n)x]. 


2 Readers who are not acquainted with the concept of uniform convergence can freely integrate 
the series term by term anyway—as Euler and his contemporaries did without a qualm—as 
long as they realize that this operation is not always legitimate and ultimately needs theoreti¬ 
cal justification. 
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which follow directly from the addition and subtraction formulas for the sine 
and cosine. It is now easy to verify that for integral values of m and n> 1 we have 


Jsi 


sin mx cos nx dx = 0 


( 5 ) 


and 


n 

1 


cosmxcosnxdx = 0 m^n. 


These facts enable us to integrate (4) term by term and obtain 

K n 

\f(x) cos nx dx = a„ j* cos 2 nx dx = a n n, 


( 6 ) 


so 


71 

«n=- \f( X ) 

n J 


cos nxdx. 


( 7 ) 


By (3), formula (7) is also valid for n - 0; this is the reason for writing the 
1 

constant term in (1) as — a 0 rather than a 0 . We get the corresponding formula 

for b n by essentially the same procedure—we multiply (1) through by sin nx, 
integrate term by term, and use the additional fact that 

7U 

J* sin mx sin nx dx = 0, m n. (8) 

-71 


This yields 


n n 

jf(x)smnxdx - b„j sin 2 nxdx = b H n, 


SO 


b„ =— f/(x) sin nxdx. 

K J 


(9) 
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These calculations show that if the series (1) is uniformly convergent, then 
the coefficients a n and b„ can be obtained from the sum fix) by means of 
the above formulas. However, this situation is too restricted to be of much 
practical value, because how do we know whether a given/(x) admits an 
expansion as a uniformly convergent trigonometric series? We don't—and 
for this reason it is better to set aside the idea of finding the coefficients a„ 
and b n in an expansion (1) that may or may not exist, and instead use for¬ 
mulas (7) and (9) to define certain numbers a n and b n that are then used to 
construct the trigonometric series (1). When this is done, these a n and b n are 
called the Fourier coefficients of the function fix), and the series (1) is called 
the Fourier series of f(x). A Fourier series is thus a special kind of trigono¬ 
metric series—one whose coefficients are obtained by applying formulas 
(7) and (9) to some given function/(x). In order to form this series, it is not 
necessary to assume that/(x) is continuous, but only that the integrals (7) 
and (9) exist; and for this it suffices to assume that/(x) is integrable on the 
interval -it < x < jt. 3 

Of course, we hope that the Fourier series of/(x) will converge and have 
/(x) for its sum, and that therefore (1) will constitute a valid representation or 
expansion of this function. Unfortunately, however, this is not always true, 
for there exist many integrable—even continuous—functions whose Fourier 
series diverge at one or more points. Advanced treatises on Fourier series 
usually replace the equals sign in (1) by the symbol ~, in order to emphasize 
that the series on the right is the Fourier series of the function on the left but 
that the series is not necessarily convergent. We shall continue to use the 
equals sign because the series obtained in this book actually do converge for 
every value of x. 

Just as being a Fourier series does not imply convergence, convergence for 
a trigonometric series does not imply that it is a Fourier series. For example, 
it is known that 


oo 



( 10 ) 


converges for every value of x, and yet this series is known not to be a Fourier 
series. 4 This means that the coefficients in (10) cannot be obtained by apply¬ 
ing formulas (7) and (9) to any integrable function/(x), not even if we make 


3 In this context "integrable" means "Riemann integrable," which is defined in terms of upper 
sums and lower sums and is the standard concept used in most calculus courses. 

4 For convergence, see Problem 2(a) in Appendix C.12 of George F. Simmons, Calculus With 
Analytic Geometry, McGraw-Hill, New York, 1985. The fact that (10) is not a Fourier series is a 
consequence of the remarkable theorem that the term-by-term integral of any Fourier series 
(whether convergent or not) must converge for all x —and this is not true for (10). 




Fourier Series and Orthogonal Functions 


293 


the obvious choice and take/(x) to be the function that is the sum of the 
series. 

These surprising phenomena prevent the theory of Fourier series from 
being at all simple or straightforward, but they also render it extraordinarily 
fascinating to mathematicians. The fundamental problem of the subject is 
clearly to discover properties of an integrable function that guarantee that 
its Fourier series not only converges but also converges to the function. We 
shall state such properties in the next section, but first it is desirable to gain 
some direct, hands-on experience with the calculation of Fourier series for 
particular functions. 

Example 1. Find the Fourier series of the function/(x) = x, -ji < x < it. First, 
by (3) we have 



If n > 1, then we find a n by using (7) and integrating by parts with u = x, 
du = cos nx dx, 



— 71 


= 0 ; 


and using (9) with u = x, dv = sin nx dx gives 



— 71 


1 71COS TlK 7CCOS(—TlTl) 


fl 


n 



n n 


n 


since cos nn = (-1)". Now, substituting these results in (1) suggests that 



(it) 


2 


3 
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It should be clearly understood that the use of the equals sign here is an 
expression of hope rather than definite knowledge. 

In Appendix A we prove that the series (11) converges to x for -k < 
x < 71. To discuss the convergence behavior of the series outside this 
interval, we introduce the concept of periodicity. A function/(x) is said 
to be periodic if fix + p) = f{x) for all values of x, where p is a positive 
constant. 5 Any positive number p with this property is called a period 
of f(x); for instance, sinx in (11) has periods 2k, 4rt,..., and sin2x has 
periods ji, 2k,.... 

It is easy to see that each term of the series (11) has period 2k —in fact, 
2rc is the smallest period common to all the terms—so the sum also has 
period 2k. This means that the known graph of the sum between -k and 
k is simply repeated on each successive interval of length 2k to the right 
and left. The graph of the sum therefore has the sawtooth appearance 
shown in Figure 39. It is clear from this that the sum of the series is equal 
to x only on the interval -k < x < k, and not on the entire real line > 

< X < 00. 

It remains to describe what happens at the points x = ±k, ±3k,..., where 
the sum of the series as shown in the figure has a sudden jump from 
—k to + k. By putting x = ±ji, ±3k,... in (11), we see that every term of the 
series is zero. Therefore the sum is also zero, and we show this fact in the 
figure by putting a dot at these points. 

The first four terms of the series (11) are 

2 1 

2 sin x, -sin 2x, —sin 3x,-sin 4x. 

3 2 

These and the next two terms are sketched as the numbered curves in 
Figure 40. The sum of the four terms listed above is 



FIGURE 39 


5 It follows that we also have/ (x - p) =f(x), as can be seen by replacing x by x - p in the above 
equation. 
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FIGURE 40 


y = 2 sin x - sin 2x + —sin 3x - —sin 4x. (12) 

Since this is a partial sum of the Fourier series, and the series converges 
to x for -jt < x < n, we expect the partial sum (12) to approximate the func¬ 
tion y = x on this interval. The accuracy of the approximation is indicated 
by the upper curves in Figure 40, which show this partial sum of four 
terms and also the sums of six and ten terms. As the number of terms 
increases, the approximating curves approach y = x for each fixed x on 
the interval -jt < x < jt, but not for x = ± jt. 


Example 2. Find the Fourier series of the function defined by 


fix) = 0, -jt < x < 0; 
f(x) = it, 0 < x < K. 
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By (3), (7) and (9) we have 


1 

a 0 — — 

71 


U 71 

j*Odx + jndx 


= 7i; 


71 

a„ = —\ncosnxdx = 0 / n> 1; 

71 J 
0 

71J 77 

0 

=—[i-(-ir]. 

w L J 

Since the nth even number is In and the nth odd number is 2 n -1 the last 
of these formulas tells us that 


: 0, &2n-l — 


2n-l 


By substituting in (1) we obtain the required Fourier series. 


f(x) = — + 2 sinx + 


sin3x sin5x 


(13) 


The successive partial sums are 

71 71 71 2 

i/ = —, i/= — + 2sinx, 1 / = — + 2sinx + — sin3x,.... 

J 2 J 2 J 2 3 

The first four of these are sketched in Figure 41, together with the graph 

of y=f(x) 



FIGURE 41 
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FIGURE 42 


We will see in the next section that the series (13) converges to the 
function f(x ) on the subintervals -n < x < 0 and 0 < x < 7t, but not at 
the points 0, n -n. The sum of the series (13) is clearly periodic with 
period 2ic, and therefore the graph of this sum has the square wave 
appearance shown in Figure 42, with a jump from 0 to it at each point 
x = 0, ±jc, ±27t,.... Further, this sum evidently has the value jt/2 at each of 
these points of discontinuity, and we indicate this fact in the figure as 
we did before, by placing a dot at each of the points in question. And 
just as before, each dot is halfway between the limit of the function 
as we approach the point of discontinuity from the left and the limit 
from the right. 


Example 3. Find the Fourier series of the function defined by 
/(x) = -|, -ji < x < 0; 

= 0 < x < 71. 

This is the function in Example 2 minus the constant n/2. Its Fourier 
series can therefore be obtained by subtracting n/2 from the series (13), 
which gives 


/(x) = 2 sinx- 


sin3x sin5x 


- + - 


(14) 


The graph of the sum of this series is simply the square wave in Figure 42 
lowered to be symmetric about the x-axis, as shown in Figure 43. 


-4n 


4— 


-2ni 


!3tt 


3n! 


i 2 tt 


i4tt x 


FIGURE 43 
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Example 4. Find the Fourier series of the function defined by 


/(*) = -— -—x, -n<x<0; 

1 2 2 

f(x) = —-—x, 0<X<71. 

J 2 2 


This is the function defined in Example 3 minus one-half the function in 
Example 1. The Fourier series can therefore be obtained by subtracting 
one-half the series (11) term by term the series (14): 


f(x)= 2 sinx - 


sin3x sin5x 


- + - 


sin2x sin3x 

-I smx- 1 - 

2 3 


sin2x sin3x v -1 sin nx 
-- sm x +- 1 -+ ■ ■ ■ = 




(15) 


The graph of the sum of this series is the sawtooth wave shown in 
Figure 44. 


The validity of the procedures used in Examples 3 and 4 depends on the eas¬ 
ily verified fact that the operation of forming the Fourier coefficients is linear; 
that is, the coefficients for the sum/(i) + g(x) are the sums of the respective 
coefficients for/(x) and for g(x), and if c is any constant, then the coefficients 
for cf(x) are c times the coefficients for f(x). Also, the Fourier series of a con¬ 
stant function is simply the constant itself. 

Remark 1. In Section 36 we show how the interval -ji < x < it of length 2it can 
be replaced by an interval of arbitrary length, with no difficulty except for a 
slight loss of simplicity in the formulas. This extension of the ideas is neces¬ 
sary for many of the applications to science. 



FIGURE 44 
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Remark 2. Our work in this section —and throughout this chapter—rests on 
the property of orthogonality for the system of functions 

1 , cos nx, sin nx (n = 1 , 2,...) 


over the interval -n < x < n. This means that the integral of the product of 
any two of these functions over the interval is zero—which is precisely the 
substance of equations (2), (5), (6) and (8). We shall return to this concept in 
Sections 37 and 38 and use it to give a simple and satisfying geometric struc¬ 
ture to the theory of Fourier series. 


NOTE ON FOURIER. Jean Baptiste Joseph Fourier (1768-1830), an excel¬ 
lent mathematical physicist, was a friend of Napoleon (so far as such people 
have friends) and accompanied his master to Egypt in 1798. On his return 
he became prefect of the district of Isere in southeastern France, and in this 
capacity built the first real road from Grenoble to Turin. He also befriended 
the boy Champollion, who later deciphered the Rosetta Stone as the first 
long step toward understanding the hieroglyphic writing of the ancient 
Egyptians. 

During these years he worked on the theory of the conduction of heat, and 
in 1822 published his famous Theorie Analytique de la Chaleur, in which he 
made extensive use of the series that now bear his name. These series were 
of profound significance in connection with the evolution of the concept of a 
function. The general attitude at that time was to ca 11 fix) a function if it could 
be represented by a single expression like a polynomial, a finite combination 


of elementary functions, a power series 
of the form 



or a trigonometric series 


1 

-«o + 


00 

^(a,, cos nx + b n sinnx). 

n =1 


If the graph of f(x) were "arbitrary"—for example, a polygonal line with 
a number of corners and even a few gaps—then fix) would not have been 
accepted as a genuine function. Fourier claimed that "arbitrary" graphs can 
be represented by trigonometric series and should therefore be treated as 
legitimate functions, and it came as a shock to many that he turned out to 
be right. It was a long time before these issues were completely clarified, and 
it was no accident that the definition of a function that is now almost uni¬ 
versally used was first formulated by Dirichlet in 1837 in a research paper 
on the theory of Fourier series. Also, the classical definition of the definite 
integral due to Riemann was first given in his fundamental paper of 1854 on 
the subject of Fourier series. Indeed, many of the most important mathemati¬ 
cal discoveries of the nineteenth century are directly linked to the theory of 
Fourier series, and the applications of this subject to mathematical physics 
have been scarcely less profound. 
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Fourier himself is one of the fortunate few: his name has become rooted in 
all civilized languages as an adjective that is well known to physical scien¬ 
tists and mathematicians in every part of the world. 


Problems 

1. Find the Fourier series for the function defined by 

f(x) = n, -Jt<x<^; 

/(x) = 0, ^ < x < n. 


2. Find the Fourier series for the function defined by 


/(*) = 


0, -7t<x<0; 

1, 0 < x < —; 

2 


0 , 


< X < 71. 


3. Find the Fourier series for the function defined by 

/(x) = 0, -7i<x<0; 

/(x) = sin x, 0 < x < n. 

4. Solve Problem 3 with sin x replaced by cos x. 

5. Find the Fourier series for the function defined by 

(a) /(x) = ir ,-n<x <tz; 

(b) /(x) = sin x, -it < x < it; 

(c) /(x) = cos x, -% < x < k; 

(d) /(x) = ir + sin x + cos x, -n < k < k. 

Pay special attention to the reasoning used to establish your conclu¬ 
sions, including the possibility of alternate lines of thought. 

Solve Problems 6 and 7 by using the methods of Examples 3 and 4, 
without actually calculating the Fourier coefficients. 
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6. Find the Fourier series for the function defined by 

(a) f(x) = -a, -n < x < 0 and/(x) = a, 0 < x < it (a is a positive number); 

(b) f(x) - -1, -it < x < 0 and/(x) = 1, 0 < x < it; 

(c) f(x) = -^,-n<x < 0and/(x) = ^,0 <x<n; 

(d) f(x) = -1,-ji < x < 0 and/(x) = 2, 0 < x < ir; 

(e) f(x) = 1, -it < x < 0 and fix) = 2, 0 < x < it. 

7. Obtain the Fourier series for the function in Problem 2 from the result 
of Problem 1. Hint: Begin by forming it - (the function in Example 2). 

8. Without using Fourier series at all, show graphically that the sawtooth 
wave of Figure. 33 can be represented as the sum of a sawtooth wave of 
period it and a square wave of period 2it. 


34 The Problem of Convergence 

The examples and problems in Section 33 illustrate several features that are 
characteristic of Fourier series in general and which we now discuss from 
a general point of view. Our purpose is to attain a good understanding of a 
useful set of conditions that will guarantee that the Fourier series of a func¬ 
tion not only converges, but also converges to the function. 

We begin by pointing out that each term of the series 



( 1 ) 


has period 2it, and therefore, if the function/(x) is to be represented by the 
sum,/(x) must also have period 2it. Whenever we consider a series like (1), we 
shall assume that/(x) is initially given on the basic interval -it < x < ir or -it < 
x < it, and that for other values of x,/(x) is defined by the periodicity condition 


f(x + 2it) - fix). 


( 2 ) 


In particular, (2) requires that we must always have/(it) =/(-it). Accordingly, 
the complete function we consider is the so-called "periodic extension" of 
the originally given part to the successive intervals of length 2it that lie to the 
right and left of the basic interval. 

The phrase simple discontinuity (or often jump discontinuity) is used to 
describe the situation where a function has a finite jump at a point x = x 0 . 
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FIGURE 45 


This means that/(x) approaches finite but different limits from the left side 
of x 0 and from the right side, as shown in Figure 45. We can express this 
behavior by writing 


lim/(x 0 - e) lim/(x 0 + e), e>0, 

G— >0 G— >0 


where it is understood that both limits exist and are finite. It will be conve¬ 
nient to denote these limits by the simpler symbols f(x 0 -) and/(x 0 +), so that 
the above inequality can be written as 

/ ( x o ~) // ( x o +)• 

A function/(x) is said to be bounded if an inequality of the form 

I/Ml 

holds for some constant M and all x under consideration. For example, the 
functions x 2 , e x and sin x are bounded on -k < x < ir, but/(x) = 1 /(ji - x) is not. 
It can be proved (see Problem 7 below) that if a bounded function/(x) has 
only a finite number of discontinuities and only a finite number of maxima 
and minima, then all its discontinuities are simple. This means that/(x -) 
and/(x +) exist at every point x, and points of continuity are those for which 
/(x-)=/(x+). 

Each of the functions shown in Figures 39, 42, 43, and 44 satisfies these 
conditions on every finite interval. However, the function defined by 
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/(x) = sin — (x * 0), /(0) = 0 

x 

has infinitely many maxima near x - 0, and the discontinuity at x = 0 is not 
simple [Figure. 46 (a)]. The functions defined by 

g(x) = x sin — (x * 0), g(0) = 0 

and 

, 1 

h(x) = x 2 sin— (x^O), h( 0) = 0 
x 

also have infinitely many maxima near x = 0 [Figures 46 (b) and 46 (c)], 
but both are continuous at x = 0 whereas only h(x) is differentiable at this 
point. 

We are now in a position to state the following theorem, which establishes 
the desired convergence behavior for a very large class of functions. 




FIGURE 46 
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Dirichlet's Theorem. Assume that f(x) is defined and bounded for -n < x < it, 
and also that it has only a finite number of discontinuities and only a finite num¬ 
ber of maxima and minima on this interval. Let f(x) be defined for other values 
of x by the periodicity condition f (x + 2ji) - f(x). Then the Fourier series off(x) 
converges to 


|[/>-)+/(*+)] 

at every point x, and therefore it converges tof(x) at every point of continuity of the 
function. Thus, if at every point of discontinuity the value of the function is redefined 
as the average of its two one-sided limits there, 

/(x) =*[/(*-)+/(*+)], 

then the Fourier series represents the function everywhere. 6 

The conditions imposed on f(x) in this theorem are called Dirichlet conditions, 
after the German mathematician P. G. L. Dirichlet who discovered the theo¬ 
rem in 1829. In Appendix A we establish the same conclusion under slightly 
different hypotheses—piecewise smoothness—which are still sufficiently 
weak to cover almost all applications. 7 

The general situation is as follows: The continuity of a function is not suf¬ 
ficient for the convergence of its Fourier series to the function, and neither is 
it necessary. 8 That is, it is quite possible for a discontinuous function to be 
represented everywhere by its Fourier series, provided its discontinuities are 
relatively mild, and provided it is relatively well-behaved between the points 
of discontinuity. In Dirichlet's theorem above, the discontinuities are simple 
and the graph consists of a finite number of increasing or decreasing con¬ 
tinuous pieces; and in the theorem we prove in Appendix A, the discontinui¬ 
ties are again simple and the graph consists of a finite number of continuous 
pieces with continuously turning tangents. 


6 We remind the reader that the value of an integrable function can be redefined at any finite 
number of points without changing the value of its integral, and therefore without changing 
the Fourier series of the function. 

7 Proofs of Dirichlet's theorem in a slightly more general form can be found in E. C. Titchmarsh, 
The Theory of Functions, 2d ed., Oxford University Press, 1950, pp. 406-407; in W. Rogosinski, 
Fourier Series, Chelsea, New York, 1950, pp. 72-74; and in Bela Sz.-Nagy, Introduction to Real 
Functions and Orthogonal Expansions, Oxford University Press, 1965, pp. 399-402. 

8 It is a major unsolved problem of mathematics to find conditions that are both necessary and 
sufficient. 



Fourier Series and Orthogonal Functions 


305 


Example. Find the Fourier series of the periodic function defined by 


First, we have 


fix) = 0, —7i < x < 0; 

fix) = x, 0 < x < 71. 


a 0 



dx = 


1 

71 2 


n 

2' 


For n > 1, we integrate by parts to obtain 


If t 1 xsin nx cos nx 

a n = — \x cos nx ax = — -h-■=— 

71 J 71 n n 

o 


= ~2 (COS ?Z7t — 1) = —y t( — 1)" — 1]/ 

nn nn 


so 


fl 2 „ = 0 and 


2 

a 2n-l ~ - - -—• 

71(277 - 1 ) 


Similarly, 


71 

f • A 1 

xcosnx 

sin nx 

n 

lx sin nx dx= — 

-+ 



J 71 

n 

n 


0 




l 

n cos nn 

(-1)" +1 

n 

n 

n 


The Fourier series is therefore 


_n 2 cos(2?7 -1)x 1 .„ +1 sinnx 

4 71 (2n-l) 2 jj 


(3) 


By Dirichlet's theorem this equation is valid at all points of continuity, 
since fix) is understood to be the periodic extension of the initially given 
part (see Figure 47). At the point of discontinuity x = it, the series con¬ 
verges to 
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FIGURE 47 


When x = it is substituted in (3), this yields the following interesting sum 
of the reciprocals of the squares of the odd numbers. 


V 1 ff_ J_ ff_ = Tf_ 

2-j(2n-lf~ + 3 2 + 5 2 + 7 2 + '"“ 8 ' 


(4) 


The same sum is obtained by substituting the point of continuity x = 0 
into (3). Further, we can use (4) to find the sum of the reciprocals of the 
squares of all the positive integers, 


00 

I 



All that is needed to establish this is to write 


Hn 2 2(2 n) 2 + 


2 


i 

(2n - l) z 


3y 1 _n 2 
4 


and 



4 ^n 2 8 

4 TC 2 _ 7T 2 
3'Y”T' 


(5) 


The sum (5) was found by Euler in 1736, and is one of the most memo¬ 
rable discoveries in the early history of infinite series. 9 


NOTE ON DIRICHLET. Peter Gustav Lejeune Dirichlet (1805-1859) was 
a German mathematician who made many contributions of lasting value 
to analysis and number theory. As a young man he was drawn to Paris by 


9 For Euler's own wonderfully ingenious way of discovering (5), see Appendix A 12 in the 
Simmons book cited in footnote 4. 
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the reputations of Cauchy, Fourier, and Legendre, but he was most deeply 
influenced by his encounter and lifelong contact with Gauss's Discjiusitiones 
Arithmeticae (1801). This prodigious but cryptic work contained many of the 
great master's far-reaching discoveries in number theory, but it was under¬ 
stood by very few mathematicians at that time. As Kummer later said, 
"Dirichlet was not satisfied to study Gauss's Disquisitiones once or several 
times, but continued throughout his life to keep in close touch with the 
wealth of deep mathematical thoughts which it contains by perusing it again 
and again. For this reason the book was never put on the shelf but had an 
abiding place on the table at which he worked. Dirichlet was the first one 
who not only fully understood this work, but also made it accessible to oth¬ 
ers." In later life Dirichlet became a friend and disciple of Gauss, and also 
a friend and advisor of Riemann, whom he helped in a small way with his 
doctoral dissertation. In 1855, after lecturing at Berlin for many years, he suc¬ 
ceeded Gauss in the professorship at Gottingen. 

One of Dirichlet's earliest achievements was a milestone in analysis: In 
1829 he gave the first satisfactory proof that certain specific types of func¬ 
tions are actually the sums of their Fourier series. Previous work in this field 
had consisted wholly of the uncritical manipulation of formulas; Dirichlet 
transformed the subject into genuine mathematics in the modern sense. As a 
byproduct of this research, he also contributed greatly to the correct under¬ 
standing of the nature of a function, and gave the definition which is now 
most often used, namely, that y is a function of x when to each value of x in 
a given interval there corresponds a unique value of y. He added that it does 
not matter whether y depends on x according to some "formula" or "law" or 
"mathematical operation," and he emphasized this by giving the example of 
the function of x which has the value 1 for all rational x's and the value 0 for 
all irrational x's. 

Perhaps his greatest works were two long memoirs of 1837 and 1839 in 
which he made very remarkable applications of analysis to the theory of 
numbers. It was in the first of these that he proved his wonderful theorem 
that there are an infinite number of primes in any arithmetic progression of 
the form a + nb, where a and b are positive integers with no common factor. 
His discoveries about absolutely convergent series also appeared in 1837. His 
convergence test, referred to in footnote 4 in Section 33, was published post¬ 
humously in his Vorlesungen liber Zahlentheorie (1863). These lectures went 
through many editions and had a very wide influence. 

He was also interested in mathematical physics, and formulated the 
so-called Dirichlet principle of potential theory, which asserts the exis¬ 
tence of harmonic functions (functions that satisfy Laplace's equation) 
with prescribed boundary values. Riemann—who gave the principle its 
name—used it with great effect in some of his profoundest researches. 
Hilbert gave a rigorous proof of Dirichlet's principle in the early twentieth 
century. 
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Problems 


1. In Problems 1,2,3,4,6 of Section 33, sketch the graph of the sum of each 
Fourier series on the interval -5k <x<5k. 

2. Use the example in the text to write down without calculation the 
Fourier series for the function defined by 


f(x) = -x, -k<x<0; 

f(x) = 0 , 0 < x < n. 


Sketch the graph of the sum of this series on the interval -5k <x<5k. 

3. Find the Fourier series for the periodic function defined by 


f(x) = -K, -7i<x<0; 

f(x) = X, 0 < X < 71. 


Sketch the graph of the sum of this series on the interval - 5 ti < x < 5 ti and 
find what numerical sums are implied by the convergence behavior at 
the points of discontinuity x = 0 and x = k. 

4. (a) Show that the Fourier series for the periodic function defined by 
/(x) = 0, -k < x < 0 and/(x) = x 2 , 0 < x < k is 



00 


00 . / . 



(b) Sketch the graph of the sum of this series on the interval - 5k < x < 5 k. 

(c) Use the series in (a) with x = 0 and jr to obtain the sums 


_J_ = nf 

2 2 + 3 2 4 2 + '" 12 



and 
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(d) Derive the second sum in 
both sides. 


(c) from the first. Hint: Add j to 


5. (a) Find the Fourier series for the periodic function defined by f(x) = 
e x , -k<x<k. Hint: Recall that sinh x = (e x - e x )/2. 

(b) Sketch the graph of the sum of this series on the interval 
-5k < x < 5k. 

(c) Use the series in (a) to establish the sums 



and 



6. Mathematicians prefer the classes of functions they study to be lin¬ 
ear spaces, that is, to be closed under the operations of addition and 
multiplication by scalars. Unfortunately this is not true for the class of 
functions defined on the interval -k < x < k that satisfy the Dirichlet 
conditions. Verify this statement by examining the functions 



and 


g(x) = ~2x. 


7. If f(x) is defined on the interval -k < x < k and satisfies the Dirichlet 
conditions there, prove that/(x-) and/(x+) exist at every interior point, 
and also that / (x+) exists at the left endpoint and / (x-) exists at the 
right endpoint. Hint: Each interior point of discontinuity is isolated 
from other such points, in the sense that the function is continuous at 
all nearby points; also, on each side of such a point and near enough 
to it, the function does not oscillate, and is therefore increasing or 
decreasing. 
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35 Even and Odd Functions. Cosine and Sine Series 

In principle, our work in the preceding sections could have been based on 
any interval of length 2n, for instance, on the interval 0 < x < 2ir. However, the 
symmetrically placed interval -n < x < n has substantial advantages for the 
exploitation of symmetry properties of functions, as we now show. 

A function/(x) defined on this interval (or on any symmetrically placed 
interval) is said to be even if 


f(-x) =/(x). 


( 1 ) 


and/(x) is said to be odd if 


/(-x) = -/(x). 


( 2 ) 


For example, x 2 and cos x are even, and x 3 and sin x are odd. The graph of 
an even function is symmetric about the y-axis, as shown in Figure 48, and 
the graph of an odd function is skew-symmetric (Figure 49). By putting x = 0 
in (2), we see that an odd function always has the property that/(0) = 0. It is 
clear from the figures that 



a 


a 


(3) 


-a 


0 


and 



(4) 


-a 


y 



X 


FIGURE 48 
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y 



X 


FIGURE 49 


because the integrals represent the algebraic (signed) areas under the curves. 
These facts can also be established by analytic reasoning based on the defini¬ 
tions (1) and (2) [see Problem 3 below]. Products of even and odd functions 
have the simple properties 

(even)(even) = even, (even)(odd) = odd, (odd)(odd) = even, 
which correspond to the familiar rules 


(+!)(+!) = + 1 / (+!)(-!) = -I (- 1 ) (-!) = +!. 


For instance, to prove the second property we consider the function F(x)-f(x) 
g(x), where/(x) is even and g(x) is odd. Then 


F(~x) =/(-*) gM =f(x) [-£(*)] = -/(x) g(x) = -F(x), 


which shows that the product/(x) g(x) is odd. The other two properties can 
be proved similarly. As an example, we know that x 3 cos nx is odd because x 3 
is odd and cos nx is even, so (4) tells us at once that 


K 



without the need for detailed integrations by parts. 

The following simple theorem clarifies the significance of these ideas for 
the study of Fourier series. 
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Theorem. Let f(x) be an integmble function defined on the interval -k < x < k. 
Iff(x) is even , then its Fourier series has only cosine terms and the coefficients are 
given by 


K 

«»=-[/(*) 

71 J 


cos nxdx, b n =0. 


(5) 


And iff(x) is odd, then its Fourier series has only sine terms and the coefficients are 
given by 


a n =0, b n = —[ f(x)sinnx dx. 

K J 

0 


( 6 ) 


To prove this, we assume first that f(x) is even. Then fix) cos nx is even (even 
times even) and by (3) we have 


7U 71 

a n = — I f(x) cos nx dx = — I f(x) cos nx dx. 

n J n J 


On the other hand ,f(x) sin nx is odd (even times odd), so (4) tells us that 

71 

bn = |/(x)sii 

n J 


) sin nxdx = 0, 


which completes the argument for (5). It is easy to establish (6) by similar 
reasoning. 


Example 1. (a) First, we briefly consider the function f(x) = x on the 
interval -jt < x < n. Since this is an odd function, its Fourier series is 
automatically a sine series, and therefore it is not necessary to bother 
calculating the cosine coefficients. We found in Section 33 that the 
Fourier series is 


. sin2x sin3x 
x = 2 smx- 1 - 


(7) 


and we know that this expansion is valid only on the open inter¬ 
val -7t < x < it and not at the endpoints x = ±n, because any series of sines 
converges to zero at these points. 
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FIGURE 50 


(b) Next, we consider the function/(x) = |x| on the interval -jt < x < 7t 
(Figure 50). Since this is an even function, its Fourier series reduces to a 
cosine series, and by (5) we have 


n 

a n = — | x | cos nx dx 

71 J 


2 

71 


Jxcos nx dx. 
0 


It is easy to see that a 0 = 7t, and for n > 1 an integration by parts gives 
a„ =^(C 0 S 77 7I —l) = ^r(-l)" -1~|. 

7 XU 7177" L J 


This tells us that 


a n = 0 and 


a 2n-\ — ~ 


4 

7l(2n -l) 2 ’ 


so we have the expansion 


71 

2 


4( cos3x cos5x 

71 1 3 2 5 2 


( 8 ) 


The periodic extension of the initially given function is shown in 
Figure 51. We see at once from the ideas of Section 34 that the series in 
(8) converges to this extension for all x, and therefore the expansion (8) is 
valid on the closed interval - 7 c < x < 7 c. 
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y 



-3it —2tt -it 


n 


2n 


3n x 


FIGURE 51 


Since | x \ = x for x ^ 0, the two series (7) and (8) are both expansions of 
the same function/(x) =ion the interval 0 < x < n. The first series (7) is 
called the Fourier sine series for x, and (8) is called the Fourier cosine series 
for x. Similarly, any function/(x) defined on the interval 0 < x < k that sat¬ 
isfies the Dirichlet conditions there can be expanded in both a sine series 
and a cosine series on this interval—with the proviso that the sine series 
cannot converge to/(x) at the endpoints x = 0 and x = jr unless/(x) has the 
value 0 at these points. 

To obtain the sine series for/(x), we redefine the function (if necessary) 
to have the value 0 at x = 0, and then we extend it over the interval -it < 
x < 0 in such a way that the extended function is odd. That is, we define 
f(x) for -it < x < 0 by putting/(x) = -f(-x). The extended function is clearly 
odd, so its Fourier series contains sine terms only, and its coefficients 
are given by (6). Similarly, we obtain the cosine series for/(x) by extend¬ 
ing f(x) to be an even function on the interval -jt < x < jt and using (5) 
to calculate the coefficients. With respect to the sine and cosine series 
described here, we emphasize particularly that the original function f(x) 
is not assumed in advance to be odd, or even, or periodic, or defined 
elsewhere at all; it is intended to be an essentially arbitrary function on 
the interval 0 < x < n —within the very weak restrictions imposed by the 
Dirichlet conditions. 


Example 2. Find the sine series, and also the cosine series, for the func¬ 
tion/^) = cos x, 0 < x < 71. 

For the sine series, (6) gives 


a„ = 0 and b n = 



7t 


0 


For h = 1 we have b j = 0, and for n > 1 a short calculation yields 


n 


2n l + (-l) n 
n 2 -1 


b, 


n 
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We therefore have 


b 2 n-i = 0 and 


b 2n = 


8 n 

7l(4« 2 - 1)' 


so the sine series is 


cosx = 


8 ^ n sin 2 nx 
n ^ An 2 -1 


0 < x < n. 


To obtain the cosine series, we observe that (5) gives b n = 0 and 


a»=- f 

71 J 


— cos x sin nx dx = 


for n = 1 
for n ^ 1. 


Therefore the cosine series for cos x is simply cos x, just as we would have 
expected. This conclusion also follows directly from the equation cos x = 
cos x, because our work in Section 33 shows that any finite trigonometric 
series (the right side) is automatically the Fourier series of its sum (the 
left side). 


Problems 

1. Determine whether each of the following functions is even, odd, or 
neither: 


x 5 sin x, x 2 sin lx, e x , (sin x) 3 , sin x 2 , cos (x + x 3 ), x + x 2 +x 3 , log 


1 + x 

1 — X 


2. show that any function/(x) defined on a symmetrically placed interval 
can be written as the sum of an even function and an odd function. 
Hint : 


fix) = \m +k-x )]+ \m-f(-x)i 

3. Prove properties (3) and (4) analytically, by making x--t in the part of 
the Integral from -a to 0 and using the definitions (1) and (2). 
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4. Show that the sine series of the constant function/(x) = it/4 is 


n 


. OJLA L Oil L JC 

= smx +-+-+ •••, 

3 5 


sin3x sin5x 


0 < x < 71. 


4 


What sum is obta.ned by putting x = jt/2? What is the cosine series of 
this function? 

5. Find the Fourier series for the function of period 2 ti defined by /(x) = 



val - 5 jt < x < 5 ji. 

6. Find the sine and cosine series for sin x. 

7. Find the Fourier series for the function of period 2 ji defined by 



(a) by computing the Fourier coefficients; 

(b) directly from the expansion (8). 

Sketch the graph of the sum of this series (a triangular wave) on the 
interval -5 ji < x < 5jt. 

8. For the function/(x) = ji - x, find 

(a) its Fourier series on the interval -k < x < jt; 

(b) its cosine series on the interval 0 < x < jr; 

(c) its sine series on the interval 0 < x < ji. 

Sketch the graph of the sum of each of these series on the interval -5 ji < 
x < 5 jt. 

9. If/(x) = x for 0 < x < ji/ 2 and/(x) = jr - x for ji/ 2 < x < jt, show that the 
cosine series for this function is 



Sketch the graph of the sum of this series on the interval -5 ji < x < 5 jt. 

10. (a) Show that the cosine series for x 2 is 
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(b) Find the sine series for x 2 , and use this expansion together with 
formula (7) to obtain the sum 

, 1 J__J_ 7^ 

3 3 + 5 3 7 3 +--- 32 - 

(c) Denote by s the sum of the reciprocals of the cubes of the odd 
numbers. 


1 + 




• = s. 


and show that then 


°° t 


,111 8 

— 1H-H- 7T H-s - + • • • — — S. 

2 3 3 3 4 3 7 


The exact numerical value of the latter sum has been one of unsolved 
mysteries of mathematics since Euler first raised the question in 
1736. 

11. (a) Show that the cosine series for x 3 is 


3 7i , v-q , cos nx 
4 Z-F ’ n 2 K 


n cos nx i 24 ^ cos(2?7 - l)x 
(277 -l) 4 ' 


0 < X < 7t. 


(b) Use the series in (a) to obtain, in this order, the sums 

1 _ 7T 4 , 1 _ 7I 4 

4 J (2n-l) 4 ~ % an 90 


12. (a) Show that the cosine series for x 4 is 

4 °o 2 2 r 

4 71 oV/ 71 n ”0 

X =-+ 8 > (-1) -i- COS77X, 

5 ZJ „ 4 

-71 < X < 71. 

(b) Use the series in (a) to obtain again the second sum in Problem 
11(b). 
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13. (a) If a is not an integer, show that 

sin an 2a sin an 


cos ax =-+ - 

an 




cos nx 


2 2 

a -n 


for -n < x < n. 

(b) Use the series in (a) to obtain the formula 


i 00 i 

1 v - ' 1 

ncotan = — + 2a> —= -,. 

a Va‘-n 


This is called Euler's partial fractions expansion of the cotangent. 
(c) Rewrite the expansion in (b) in the form 


n V" 1 -2 1 
ncotnf-= 


nf 


Zu n 2 ~t 2 ' 


and by integrating term by term from f = 0tof = x(0<x<l) obtain 

/ • \ °o / 

f sirinx ) — ' 




or 


sinnx 

nx 


/ 

2 \ 

f 

2 \ 

f 

2 'N 


X 


X 


X 

1 - 


1 - 


1 - 



l 2 


2 2 


3 2 

V 

^ J 

V 

^ J 

V 



If x is replaced by x/n, this infinite product takes the equivalent 
form 


sinx 

( 

1- 

V 

2 \ 
X 

/ 

1 

x 2 ^ 

/ 

1 

x 2 ^ 

X 

" 2 J 

V 

4n 2 J 

V 

9n 2 J 


which is called Eider's infinite product for the sine. Observe that this 
formula displays the nonzero roots x = ±n, ±2n, ±3n,... of the tran¬ 
scendental equation sin x = 0. 

14. The functions sin 2 x and cos 2 x are both even. Show briefly, without cal¬ 
culation, that the identities 

• 2 1 ,, „ , 1 1 

sin x = —(l-cos2x) =-cos2x 

2 2 2 
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and 


1 11 

cos 2 x = — (1 + cos 2x) = — + — cos 2x 
2 2 2 

are the Fourier series expansions of these functions. 

15. Find the sine series of the functions in Problem 14, and verify that these 
expansions satisfy the identity sin 2 x + cos 2 x = 1. 

16. Prove the trigonometric identities 

3 1 3 1 

sin 3 x = — sin x-sin 3x and cos 3 x = —x + — cos 3x, 

4 4 4 4 

and show briefly, without calculation that these are the Fourier series 
expansions of the functions on the left. 


36 Extension to Arbitrary Intervals 

The standard form of a Fourier series is the one we have worked with in the 
preceding sections, where the function under consideration is defined on the 
interval -ji < x < jt. In many applications it is desirable to adapt the form of a 
Fourier series to a function/(x) defined on an interval —L < x < L, where L is 
a positive number different from it. This is done by a change of variable that 
amounts to a change of scale on the horizontal axis. 

We introduce a new variable t that runs from -ji to Jt as x runs from - L to 
L. This is easy to remember as a statement about proportions: 


t _ x 
n~L' 


so 



and 



( 1 ) 


The function/(x) is thereby transformed into a function of t, 
f(x) = / ^ j = g(t), -n<t <n, 


and if we assume that/(x) satisfies the Dirichlet conditions, then so does g(t). 
We can therefore expand g(t) in a Fourier series of the usual form. 


1 

g(t) = —a 0 + ^(a„ cos nt + b n sin nt ), 


( 2 ) 
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where we use the familiar formulas for the coefficients, 

n n 

a n = — \g(t)cosntdt and b n =—\g(t) sin ntdt. (3) 

n J n J 


Having found the expansion (2), we now use (1) to transform this back into a 
solution of our original problem, namely, to find an expansion of f(x) on the 
interval - L < x < L: 


, 1 'srV nnx , 

j(x) = — a 0 + \ a n cos-+ b n sm 


nnx \ 

IT Y 


(4) 


Of course, we can also transform formulas (3) into integrals with respect to x, 

L L 

a„ = j*/(x)cos^^dx and b n = jf(x)sin l ^^dx. (5) 


-L 


-L 


We can use formulas (5) directly if we wish to do so, but changing the vari¬ 
able to t usually makes the work easier because it simplifies the calculations. 


Example 1. Expand f{x) in a Fourier series on the interval -2 < x < 2 if 
f(x) = 0 for -2 < x < 0 and/(x) = 1 for 0 < x <2. 

Here we introduce t by writing 


t _ x 

7t 2 7 



and 


It 

x = —. 


71 


Then g(t) = 0 for -jt < t < 0 and g(t) = 1 for 0 < t < n, and we have 


a 0 - - 


u n 

jodt + jldt 


= 1 ; 


n J 


cosnt dt = 0, n> 1; 


n 

b n = — f sin nt dt = — [1 - (-1)"]- 
n J nn 




Fourier Series and Orthogonal Functions 


321 


The last of these formulas tells us that 



We therefore have 



sin (2?; -l)f 
2h-1 ' 


so the desired expansion is 



Further, we know that this series converges to the periodic extension of 
f(x) [with period 4] at all points x except the points of discontinuity x = 
0, ±2, ±4,..., and at these points it converges to the sum 1/2, which is the 
average of the two one-sided limits. 


Problems 

1. For the function defined by 

f(x) = -3, -2 < x < 0 and f(x) -3,0<x<2, 

write down its Fourier expansion directly from the example in the text, 
without calculation. 

2. Find the Fourier series for the functions defined by 

(a) f(x) - 1 + x, -1 < x < 0 and f(x ) = 1-x, 0 < x < 1; 

(b) f(x) = \x\, -2<x <2. 

3. Show that 



0 <x <L. 


4. Find the cosine series for the function defined on the interval 0 < x < 1 


1 

by f(x) - x * 1 2 3 - x + —. (In the context of Problem 9 below, this function is 


6 
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the Bernoulli polynomial B 2 (x), and the series found here is the simplest 
special case of the expansion in Problem 10.) 

5. Find the cosine series for the function defined by 


f(x) - 2, 0 < x < 1 and f(x ) = 0,1 < x < 2. 


6. Expand/(x) = cos nx in a Fourier series on the interval -1 < x < 1. 

7. Find the cosine series for the function defined by 



8. (This problem and the next are necessary preliminaries for the Fourier 
series problem that follows them, and this in turn is aimed at obtaining 
the remarkable formulas in Problem 11.) Since 


for x * 0, and this power series has the value 1 at x = 0, the reciprocal 
function x/(e x -1) has a power series expansion valid in some neighbor¬ 
hood of the origin if the value of this function is defined to be 1 at x = 0: 



0 


o 


The numbers B„ defined in this way are called Bernoulli numbers and 
play an important role in the theory of infinite series 10 . Evidently B 0 = 1. 

(a) By writing 



r x i 

x x e +1 


x x e 


2 2V-1 


+ — 


10 For instance, it can be proved that the power senes expansion of tan x is 



See Appendix A.18 in the Simmons book mentioned in footnote 4. 
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and noticing that the second term on the right is an even function, 
1 

conclude that B, = — and = 0 if n is odd and >1. 

i 2 n 

(b) By writing (*) in the form 



By 

1! 


x + 






3! 


= 1 


and multiplying the two power series on the left, conclude by 
examining the coefficient of x nA that 


v°y 


Bq + 


Kb 


Bi + 


v 2 y 


Bo + • • • + 


K n ~b 


Bn -1 - 0 


n 


for n > 2, where 


Kb 


is the binomial coefficient n\/[k\(n - fc)!] 


(c) By taking n- 3, 5, 7, 9,11 in (**), show that 


B 2 = 


1 

6' 





Bio — 


_5_ 

66 ' 


From the recursive mode of calculation, all the Bernoulli numbers 
can be considered as known (even though considerable labor may 
be required to make any particular one of them visibly present) and 
all of them are rational. 

9. The Bernoulli polynomials B 0 (x), B,(x), B 2 (x),... are defined by the resulting 
coefficients in the following product of two power series (see the pre¬ 
ceding problem): 


e 


X t 





A o y 


Z B n (x) t „ 

ft! 


(a) Show that B n (x) is a polynomial of degree n that is given by the 
formula 


B„(x) = 




vOy 


B 0 x n + 


f ft A 


Kb 


B 1 x n - 1 +--- + 


f ft ^ 


vH-ly 


B m _ iX + 




\ n j 


B„. 
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(b) Show that B„( 0) = B n for n > 0, and by using (**) in the preceding 
problem, show that also B„(l) = B n for n > 2. 

(c) Show that 


B'„ +1 (x) = {n + l)B„(x). 


and deduce from this that 


X 

B n +i(x) = B n+ 1 + (n +1) J B n (t) dt 


and (if n > 1) 


J B n {x)dx = 0. 


(d) Show that 


1 1 

B 0 (x) = l, Bfx) = x —, B 2 (x) = x 2 -x + — r 

2 6 

3 1 T 1 

B 3 (x) = x 3 — x 2 + — x, BJx) = x 4 -2x 3 + x 2 -. 

w 2 2 30 


10. Show that the cosine series for the Bernoulli polynomial B 2n (x) on the 
interval 0 < x < 1 is 


n+ i 2(2 n)l coslknx 
* 2n ' 


11. Use the expansion in Problem 10 to show that 


' ^ = (_i y +1 ^ j 
‘n 2p { ’ 2(2 p)\ 


.2 V 


where p is a positive integer. Use the results of Problem 8 to obtain the 
special sums corresponding to p = 1, 2, 3,4, 5: 





Fourier Series and Orthogonal Functions 


325 


^ 1 _ k 2 1 _ 7I 4 Vjh 1 _ 7I 6 

4^“T' O' Zj^ _ 945' 

“ i 8 ® -i 10 

1 _ 71 ^ 1 TC 

4^-9450' 4 ^- 93555 ■ 

These discoveries are all due to Euler. 11 


37 Orthogonal Functions 

A sequence of functions 0 n (x), n = 1, 1, 3,..., is said to be orthogonal on the 
interval [a, b] u if 



for mjtn, 
for m = n. 


( 1 ) 


For example, the sequence 

0j(x) = sin x, 0 2 (x) = sin 2x,..., 0„(x) = sin nx,... 
is orthogonal on [0, it] because 


j0,„(x)0„(x)dx = J 
0 0 


sin mx sin nx dx 



[cos(m -n)x- cos (m + n)x\dx< 


= 0 
_ n 
~2 


for m * n r 
for m = n. 


We pointed out in Section 33 that the sequence 

1, cos x, sin x, cos 2x, sin 2x,... (2) 


11 For more information on the background of these formulas, see the article by Raymond 
Ayoub, "Euler and the Zeta Function," American Mathematical Monthly, vol. 81 (1974) 
pp. 1067 -1086. 

12 As usual, this notation designates the closed interval a <x <b. 
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is orthogonal on [—71, it] but it is not orthogonal on [0, it] because 


J lsinx dx = 2^0. 

o 

In the preceding sections of this chapter the trigonometric sequence (2) was 
used for the formation of Fourier series. During the nineteenth and early 
twentieth centuries many mathematicians and physicists became aware 
that one can form series similar to Fourier series by using any orthogonal 
sequence of functions. These generalized Fourier series turned out to be 
indispensable tools in many branches of mathematical physics, especially 
in quantum mechanics. They are also of central importance in several major 
areas of twentieth century mathematics, in connection with such topics as 
function spaces and theories of integration. 13 

The formula for the generalized Fourier coefficients is particularly simple 
if the integral (1) has the value 1 for m - n. In this case the functions 0„ (x) 
are said to be normalized, and {0„ (x)} is called an orthonormal sequence. On the 
other hand, if 



dx = a„ * 1 


in (1), then it is easy to see that the functions 


<t> n (x) = 


0„(x) 



are orthonormal, that is. 


b 



(x)<t>„(x)dx 


a 



for m * n, 
for m = n. 


For example, since 


K K 

j\dx = 2k, j " 


cos 2 nx dx = n, sin 2 nx dx = n 


(3) 


(4) 


13 See, for example, the excellent book by Bela Sz.-Nagy, Introduction to Real Functions and 
Orthogonal Expansions, Oxford University Press, 1965. 




Fourier Series and Orthogonal Functions 


327 


for n> 1, the orthonormal sequence corresponding to 
sequence (2) is 

1 cosx sinx cos2x sin2x 

-Jin' yjn ' \fn ' ~Jn ' ~Jn 

Now let |(|)„(x)} be an orthonormal sequence of functions on [a, b] and sup¬ 
pose that we are trying to expand another function/(x) in a series of the 
form 


the orthogonal 


(5) 


/(x) = flql), (x) + n 2 <|> 2 (x) + ■ ■ ■ + a n $ n (x) + ■ ■ ■. (6) 

To determine the coefficients a„ we multiply both sides of (6) by <|>„(x). 
This gives 


/(x)4>»(x) = «i<l>i(x)tl>„(x) + ■ ■ ■ + a^M 2 + ■ ■ v (7) 

where the terms not written contain products (|) m (x)<|)„(x) with m * n. If we 
assume that term-by-term integration of (7) is valid, then by carrying out 
this integration and using (3) we find that most of the terms disappear and 
all that remains is 



a n [if n {x)fdx = a n , 


so 



( 8 ) 


In deriving formula (8) for the coefficients in the expansion (6), we made 
two very large assumptions. First, we assumed that the function/(x) can be 
represented by a series of the form (6). Second, we assumed that the term- 
by-term integration of the series (7) is permissible. Unfortunately, we have 
no reason whatever—apart from wishful thinking—for believing that either 
assumption is legitimate. To express this somewhat differently, we have no 
guarantee at all that the series (6) with coefficients defined by (8) will even 
converge, let alone converge to the function/(x). Nevertheless, the numbers 
(8) are called the Fourier coefficients of/(x) with respect to the orthonormal 
sequence {4>„(x)}, and the resulting series (6) is called the Fourier series of /(x) 
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with respect to {(|)„(x)}. 14 When these ideas are applied to the orthonormal 
sequence (5), they yield the ordinary Fourier series as described in the pre¬ 
ceding sections (see Problem 2 below). 

We also point out, as we did in Section 33, that the term-by-term integra¬ 
tion of (7) that leads to (8) is legal if the functions are continuous and the 
series is uniformly convergent. However, in the next section formula (8) will 
be obtained in an entirely different manner, having nothing to do with uni¬ 
form convergence. It will then be clear that there is no need to feel uneasy 
because formula (8) seems to have been derived by faulty reasoning. The 
truth is, that we can use whatever reasoning we please as motivation for the 
definitions of the Fourier coefficients and Fourier series, and we then turn to 
the problem of discovering conditions under which the Fourier series (6) is a 
valid expansion of the function/(x). 

Most orthogonal sequences of functions are obtained by solving differen¬ 
tial equations, as suggested in the following example. A broader discussion 
of this topic is given in Section 43. 


Example 1. Use the differential equation y" + \y = 0, or equivalently 
y" = -Xy, to show that the trigonometric sequence (2) is orthogonal on [-it, jt]. 

Let m and n be positive integers. If y m = sin mx or cos mx and y n = sin nx 
or cos nx, then 


y" m = -m 2 y m and y" = -« 2 y„. 

If the first equation is multiplied by y„, the second by y m , and the result¬ 
ing equations are subtracted, the result is 

y„y m - y„,y n = (n —tn )y, n y n . 

We now notice that the left side of this is the derivative of y„y,'„ - y m y(, so 
integrating from -k to k gives 


71 

{y n y'm - ymy'n)J_ K = (n 2 - m 2 ) Jy,„y„ Ax. (9) 

-71 


The function y„y' m - y m y'n is periodic with period 2 ji and therefore has 
the same values at -jr and k, so the left side of (9) is zero. This yields the 
orthogonality property 


yjmy, 


dx = 0, 


14 Some writers make consistent use of the terms generalized Fourier coefficients and generalized 
Fourier series. We prefer to simplify the terminology by omitting the adjective "generalized," 
and to rely on the context to tell us whether we are dealing with generalized or ordinary 
Fourier series. 
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except in the case m = n. In this case, however, the relevant integral is 
easy to evaluate: 


7t 


I 


sinnxcosnxdxm- — sm 2 nx =0. 
2 n 


All that remains is to notice that the function 1 in the sequence (2) is 
orthogonal to all the others, that is, 



7t 


for every n, and this completes the argument. 

There is a very suggestive analogy between Fourier series and vectors that 
should be mentioned here. Let us briefly consider ordinary three-dimensional 
Euclidean space. In this space i, j, k are familiar mutually perpendicular unit 
vectors in the coordinate directions, and other vectors can be written in the 
form 


A = flji + + a 3 k 


and 


B = Fji + b 2 ) + b 3 k. 


Let us denote the "dot product" A • B of A and B by the symbol (A, B), so that 


(A,B) = a 1 b 1 + a 2 b 2 + a 3 b 3 . 


( 10 ) 


In the present context we prefer to call this quantity the inner product of A 
and B, and our purpose is to point out that this inner product is closely con¬ 
nected with the most important geometric features of the space. 

First, two vectors A and B are orthogonal (or perpendicular) if their inner 
product is zero, that is, if 


(A,B) = a 1 b 1 + a 2 b 2 + a 3 b 3 = 0. 


( 11 ) 


Next, the inner product underlies the concept of the norm, or length, of a vec¬ 
tor A: if we denote the norm by || A||—a symbol that resembles, but differs 
from, the absolute value sign—then 
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This norm in turn gives rise to the concept of the distance between any two 
points in the space, or equivalently, the distance between the tips of any two 
vectors. 


d (A,B) = || A — B ||. (13) 

As our final bit of review, we recall that if u,, u 2 , u 3 are any three mutually 
orthogonal unit vectors, then every vector V can be expressed in the form 

V = oqUj + a 2 u 2 + a 3 u 3 , (14) 

where cq a 2 , a 3 are constants. In order to determine these constant coeffi¬ 
cients for a given vector V, we form the inner product of both sides of (14) 
with u k , where k = 1,2, or 3. This yields 

(V, u^) = cq (u^Uj) + a 2 (u 2 , u*) + <x 3 (u 3 ,u*); 

and since the vectors u v u 2 , u 3 are mutually orthogonal and have length 1, 
the sum on the right collapses to a single term, 

(V, u*) = a k . 

The formula for the coefficients is therefore 

«* = (V, uf (15) 

Equations (14) and (15) should be compared with (6) and (8), because their 
meanings are very similar. In essence, the cq are the "Fourier coefficients" of 
the vector V, and (14) is its expansion in a "Fourier series." 

In the case of genuine Fourier series, we work with functions defined on 
an interval [a, b] instead of with vectors. We speak of a "function space" 
instead of a three-dimensional "vector space." This function space is 
infinite-dimensional, in the sense that we need an infinite orthonormal 
sequence to represent an arbitrary function. Life is somewhat more compli¬ 
cated in this infinite-dimensional space than it is in the three-dimensional 
space described above. First, it turns out that only special kinds of ortho¬ 
normal sequences are capable of representing "arbitrary" functions. And 
second, it is necessary to introduce restrictions that remove the vagueness 
from the expression "arbitrary function" and precisely define the class of 
functions that are to be represented by their Fourier series. We begin this 
precise discussion in the next few paragraphs, and continue it in the next 
section. 

The function space we consider is denoted by R and consists of all func¬ 
tions/^) that are defined and Riemann integrable on the interval [a, b]. Since 
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the inner product (10) is the sum of products of components, and since the 
values of a function can be thought of as its components, it is natural to define 
the inner product (f, g) of two functions in R by 

b 

(f,g) = \f(x)g(x)dx. (16) 


Clearly, 


(A +fug) = (fug) + (f 2 ,g), 

(cfg) = c(f,g) and {f,g) = ( g,f ). 

With (11) as our guide, we say that/and g are orthogonal if their inner product 
is zero, that is, if 


(fig) = o. 


This is precisely the meaning of orthogonality as given in Section 33, 


f(x)g(x)dx = 0. 


By the definition at the beginning of this section, an orthogonal sequence in 
R is a sequence with the property that each function is orthogonal to every 
other and no function is orthogonal to itself. Continuing the analogy, the 
norm of a function/is defined by 



(17) 


so that 


ll/ll 2 =m 


A function/is called a null function if 


|/|| = 0 or, equivalently, if 


b 

J* [f(x)] 2 dx = 0. 


a 
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A null function need not be identically zero. For example, iif(x) = 0 on [-ir it] 

1 1 

except at the points x = 1 , — , but f(x) = 1 at these points, then/is a null 

function. In the present context it is convenient to consider a null function 
as being essentially equal to zero, so that two functions are considered to 
be equal if their difference is a null function. With this understanding, the 
norm has the simple properties 




> 0 , 


||/|| = 0 if and only if/ = 0. 
Two properties that are not so simple are 

\<f'g)\ * ll/ll hi 


(18) 


(19) 


and 


ll/+slMI/IMIsll- (20) 

The inequality (19) is called the Schwarz inequality. By using (16) and (17), it 
can be written out as follows [in the form (f, g) 2 < ||/|| 2 ||y|| 2 ]: 


I f(x)g(x)dx 


b b 

^jlf(x)fdx-j[g(x)] 2 dx. 

a a 


The inequality (20) is called the Minkowski inequality ; its written-out form is 


b 


\[f(x) + g(x)fdx 



The integral versions of these inequalities have a formidable appearance, 
and one might think that probably they cannot be established except by the 
use of complicated reasoning. In fact, however, there exists a simple but inge¬ 
nious proof of (19) which we ask readers to think through for themselves 
(Problem 3 below); and (20) follows quite easily from (19) by an argument that 
we give here. Thus, by Schwarz's inequality we have 
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11/+ g\\ 2 = (f+ g,f + g) = iff) + 2(/, g) + (g,g) 


ll/ll 2 + 2{f,g) + \\g\\ 


2 


* ll/ll 2 + Wg)\ + IHI 2 


< 


ll/ll 2+ 2 1|/|| \\g\\ + \\g\\ 


2 



and we now obtain (20) by taking square roots. 

By using the concept of the norm of a function, we are now able to define 
the distance d(f,g) between two functions/and g in R: 


d (f'g) = \\f-g\\= \[f(x)-g(x)fdx 


( 21 ) 


a 


We also speak of d (f, g) as the distance from/to g, or the distance of g from/ 
It is easy to see from (18) and (20) that distance has the following properties: 


d(f, g) > 0, and d(f, g) = 0 if and only if/ = g; 


d(f>g) = d(g,f) [; symmetry ]; 


d(f, g) < d(f, h ) + d(h, g) [triangle inequality]. 


A space (of vectors, functions, or any objects whatever) with a distance func¬ 
tion possessing these properties is called a metric space. With the understand¬ 
ing that functions in R are considered to be equal if they differ by a null 
function, R is a metric space whose structure we continue to investigate in 
the next section. 

NOTE ON MINKOWSKI. At the age of 18 the Russian-German mathemati¬ 
cian Hermann Minkowski (1864-1909) won the Grand Prize of the Academy 
of Sciences in Paris for his brilliant research on quadratic forms, starting 
from a problem about the representation of an integer as the sum of five 
squares. This work later led to the creation of a whole new branch of number 
theory now called the Geometry of Numbers, which in turn is based on his 
highly original ideas about the properties of convex bodies in /(-dimensional 
space. In this connection he introduced the abstract concept of distance, 
analyzed the notions of volume and surface, and established the important 
inequality that bears his name. In the years 1907-1908 Minkowski became 
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the mathematician of relativity by geometrizing the new subject. He created 
the concept of four-dimensional space-time as the proper mathematical set¬ 
ting for Einstein's essentially physical (and nonmathematical) way of think¬ 
ing about special relativity. In a now-famous lecture of 1908 he began with a 
sentence that is not easily forgotten: "From now on space by itself, and time 
by itself, are doomed to fade away into mere shadows, and only a kind of 
union of the two will retain an independent existence." 

NOTE ON SCHWARZ. Hermann Amadeus Schwarz (1843-1921), a pupil 
of Weierstrass whom he succeeded in Berlin, made substantial contributions 
to the theory of minimal surfaces in geometry and to conformal mapping, 
potential theory, hypergeometric functions, and other topics in analysis. 
In conformal mapping, he rescued and rigorously nailed down some of 
Riemann's very important but rather intuitive discoveries, especially the 
basic Riemann mapping theorem. In minimal surfaces, he gave the first rig¬ 
orous proof that a sphere has a smaller surface area than any other body 
of the same volume. He also discovered and proved the "pedal triangle" 
theorem of elementary geometry: In any acute-angled triangle, the inscribed 
triangle with smallest perimeter is the one whose vertices are the three feet 
of the altitudes of the given triangle. 15 


Problems 

1. One of the important consequences of the orthogonality properties of 
the trigonometric sequence (2) [namely, equations (4) in this section 
and (2), (5), (6), (8) in Section 33] is Bessel’s inequality: If f(x) is any func¬ 
tion integrable on [-ir, k], its ordinary Fourier coefficients satisfy the 
inequality 


1 

2 


al + 


^Ja l + hi 

k =1 


)<-f[/(x )] 2 

71 J 


dx. 


o 


Prove this by the following steps: 
(a) For any n > 1, define 



n 

^ ( a k cos kx + bk sin kx) 

k =1 


15 For details, see Chapter 5 of H. Rademacher and O. Toeplitz, The Enjoyment of Mathematics, 
Princeton University Press, 1957; or R. Courant and H. Robbins, What Is Mathematics ?, Oxford 
University Press, 1941, pp. 346-51. 
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and show that 


1 I* 1 

- J f(x)s n (x)dx = -«o + + bk)- 


(b) By considering all possible products in the multiplication of s n (x) by 
itself, show that 


1 

n 



^-al + '*£(^al + bi). 

k -1 


(c) By writing 


- [ [f(x)-s n (x)] 2 dx 
71 J 


- f [f(x)] 2 dx- — f f(x)s„(x)dx +— [[s n (x)] 2 dx 
K J 71 J 71 J 

— 7t —71 —71 

7U 

^ | lf{x)fdx -1 + l)2) r 


conclude that 



fc=l 


) < - f [/(x )] 2 

71 J 


dx, 


and from this complete the proof. 

Observe that the convergence of the series on the left side of (*) 
implies the following corollary of Bessel's inequality: If a„ and b n are 
coefficients of/(x), then a n -> 0 and b n 0 as n -> °°. 

2. In the case of the orthonormal sequence (5), verify in detail that the 
Fourier coefficients (8) are slightly different from the ordinary Fourier 
coefficients, but that the Fourier series (6) is exactly the same as the 
ordinary Fourier series. 

3. Prove the Schwarz in equality (19). Hint: If ||y|| / 0, then the function 
F(a)= ||/+ay|| 2 is a second degree polynomial in a that has no negative 
values; examine the discriminant. 
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4. A well-known theorem of elementary geometry states that the sum 
of the squares of the sides of a parallelogram equals the sum of the 
squares of its diagonals. Prove that this called parallelogram law is true 
for the norm in R: 

2 II/II 2 + %II 2 = ll/+sll 2 + ll/-£ll 2 - 

5. Prove the Pythagorean theorem and its converse in R:f is orthogonal to g 
if and only if \\f-g\\ 2 = \\f\\ 2 + ||y|| 2 . 

6. Show that a null function is zero at each point of continuity, so that a 
continuous null function is identically zero. 


38 The Mean Convergence of Fourier Series 

Consider a function/(x) and a sequence of functions p„(x), all defined and 
integrable on the interval [a, b\. There are different ways in which p„(x) can 
converge to /(x), and these are best understood in terms of the problem of 
approximating/(x) by p„ (x). 

If we try to approximate/(x) by p n {x), then each of the numbers 

l/(x) - P„(x) | and \f{x) - p„(x)] 2 (1) 

gives a measure of the error in the approximation at the point x. It is clear 
that if one of these numbers is small, then so is the other. The usual definition 
of convergence amounts to the statement that the sequence of functions p„(x) 
converges to the function/(x) if for each point x either of the expressions (1) 
approaches zero as n -h> °o. This is the familiar concept used in Sections 33 to 
36, and for obvious reasons it is called pointwise convergence. 

On the other hand, we might prefer to use a measure of error that refers 
to the whole interval [a, b] simultaneously, instead of point by point. We can 
obtain such a measure by integrating the expressions (1) from a to b, 

b b 

|/(x)-p„(x)|dx and \[f(x)-p n (x)] 2 dx. 


The second integral here is a better choice than the first, for two reasons: it 
avoids the awkward absolute value sign in the first integral; and the expo¬ 
nent 2 makes many of the necessary calculations very convenient to carry 
out, as we will see below. The measure of error we adopt is therefore 
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b 


j[f(x)-p n (x)fdx. 


( 2 ) 


a 


This quantity is called the mean square error. The terminology is appropri¬ 
ate because if the integral (2) is divided by b - a, the result is exactly the 
mean value of the square error \f(x) - p„(x)] 2 . If (2) approaches zero as n -* 
the sequence {p n (x)} is said to converge in the mean to f(x), and this concept is 
called mean convergence. We sometimes symbolize this mode of convergence 
by writing 


/(x) = l.i.m.p„(x), 


where "l.i.m." stands for "limit in the mean." Our discussion in the rest of 


this section will show that in the case of Fourier series mean convergence is 
much easier to work with than ordinary pointwise convergence. 

We assumed at the beginning that the functions/(x) and p n (x) belong to the 
function space R described in the preceding section. We now point out that 
the mean square error (2) is precisely the square of the norm of/- p„ in R, 



(3) 


The mean convergence of p„(x) to/(x) is therefore completely equivalent to the 
convergence of the sequence {p, ,} to the limit/in the metric space R, namely. 


d(f,p„) = \\f-Pn\\ 0 as n -> 


As indicated here, we will often use/and p n as abbreviations for/(x) and p„(x), 
in order to simplify the notation. 

We now come to the main business of this section. Let {(|)„(x)} be an ortho¬ 
normal sequence of integrable functions on [a, b], so that 



for m * n, 
for m = n. 


(4) 


We consider the first n of these functions, 


4>i(x), (|> 2 (x),..., 4>„(x), 


(5) 
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and we seek to approximate a given integrable function/(x) by a linear com¬ 
bination of the functions (5), 


P«(x) = bM x ) + M> 2 (*) + ■■■ + b„ 4>„(x). 


Our purpose is to minimize the mean square error (2), 

b b 

E„ =|[/-p„] 2 dx = |[/-(fc 1 ( l ) l +--- + b„$ n )] 2 dx, (6) 

a a 

by making a suitable choice of the coefficients by..., b n . 

Our first step is to expand the term in brackets in (6), which yields 

b b 

E n = J/ 2 dx - 2 J(b 1 (|)i + • • • + b n §„) / dx 

a a 

b 

+ H-h &„<|>„) 2 £ix. (7) 


If the Fourier coefficients of/with respect to the orthonormal sequence {cf> fc } 
are denoted by 


flic 



dx, 


a 


as in Section 37, then the second integral in (7) is 



+ • • • + b „§„)/ dx - afi + —h a n b„. 


The third integral in (7) can be written 



1 H-h b n § n )(£’l4 > l ^- b„ty„ ) dx 


a 


b 

= J* (bf j/ + • • • + bfyl +---)dx 

a 

= bl + --- + bl, 


Fourier Series and Orthogonal Functions 


339 


where the second group of terms "+ ■ • •" contains products cf), <|), with i * j and 
the final value results from using (4). These considerations enable us to write 
the mean square error (7) as 


« n n 

E n = J f 2 dx - 2^ji k b k + 


If we now notice that 


-2 a k b k + b k — -a 2 + (b k - a k ) 2 , 


then the formula for E n takes its final form, 

b n n 

E n =\f 2 dx- Jal + ^\b k - a k f. 

a k =1 k =1 


( 8 ) 


(9) 


Formula (9) for the mean square error E n has a number of important conse¬ 
quences that follow by very simple reasoning. First, the terms (b k - a,) 2 in (9) 
are positive unless b k = a k , in which case they are zero. Therefore the choice of 
the b k that minimizes E n is obviously b k = a k , and we have 


Theorem 1. For each positive integer n, the nth partial sum of the Fourier series of 
f namely, 


n 

'^^Clkfyk - a l§l - \~ a n§n/ 

k =1 


b 

gives a smaller mean square error E n = J*(/ - p n ) 2 dx than is given by any other lin- 

a 


ear combination p n = b l § 1 + • ■ • + b tl $ ir Further, this minimum value of the error is 


min E n 



( 10 ) 


Formula (6) tells us that we always have E u > 0, because the integrand in (6), 
being a square, is nonnegative. Since E n > 0 for all choices of the b k , it is clear 
that the minimum value of E n (which arises when b k = a k ) is also > 0. Therefore 
(10) implies that 
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By letting n -> °° we at once obtain 

Theorem 2. If the numbers a„ = /)>„ dx are the Fourier coefficients of f with 

respect to the orthonormal sequence {(])„}, then the series ^T^al converges and satis¬ 
fies Bessel’s inequality, 



( 11 ) 


Since the «th term of a convergent series must approach zero. Theorem 2 
implies 

Theorem 3. If the numbers a n = I /()>„ dx are the Fourier coefficients of f with 

J n 

respect to the orthonormal sequence {cf>„}, then a n —> 0 as n -> 

Theorems 2 and 3 are obtained for ordinary Fourier series in Problem 37-1. 
Here they are seen to be true for generalized Fourier series with respect to 
arbitrary orthonormal sequences. 

For applications it is important to know whether or not the Fourier series 
of/is a valid expansion of/in the sense of mean convergence. This is equiva¬ 
lent to asking whether or not the partial sums of the Fourier series of/con¬ 
verge in the mean to/ that is, whether or not 


n 

/ = l.i.m.V a k i? k . 

n-> oo < 4 


k =1 


( 12 ) 


In view of Theorem 1 it is evident that we do have a valid expansion of/if 
and only if 

min E n -» 0 as n -> 

and by formula (10) we see that this happens if and only if Farseval’s equation 
holds: 



flit =0. 


We summarize these observations in the following theorem. 
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Theorem 4. The representation off by its Fourier series, namely, 

f= Mb + n 2§2 + ■ ■ ■ + + ■ • V (13) 

is valid in the sense of mean convergence if and only if Bessel’s inequality (11) 
becomes Parseval’s equation, 



(14) 


If a Fourier expansion of the form (13) is valid (in the sense of mean conver¬ 
gence) for every function/(x) in R, then the orthonormal sequence j (<]>., (x)) is 
said to be complete. A complete sequence, then, is a sequence |(|)„} that can be 
used for mean square approximations of the form (12) for arbitrary functions 
/in R. It can be proved that the trigonometric sequence 

1 cosx sinx cos2x 

-fin' fn ’ fn ’ fn 

is complete on [-n, ji]. 

Remark 1. The proof of the theorem just stated 
sequence (15) is long and would take us much too far afield. 16 However, if 
we recall Problem 2 in Section 37, then we see that this theorem immediately 
yields the following major conclusion, which can be interpreted as sweeping 
away all the difficulties that arise in the theory of pointwise convergence for 
Fourier series. 


sin2x 

-fn 


(15) 


about the trigonometric 


Theorem 5. Iff(x) is any function defined and integrable on [-it, ji], then fix) is rep¬ 
resented by its ordinary Fourier series in the sense of mean convergence, 


/(x) = -«o + 


00 

cos nx + b„ sin nx ), 

«=i 


(16) 


where the a n and b n are the ordinary Fourier coefficients of fix). 


16 The basic tools for the proof we have in mind are two major theorems of classical analysis, 
Fejer's summability theorem and the Weierstrass approximation theorem. 
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To appreciate the clean simplicity of this statement, it helps to recall from our 
previous work that this representation theorem is false if (16) is interpreted 
in the sense of pointwise convergence; further, the representation even fails 
for some continuous functions. 

Remark 2. In Problem 6 below we ask the student to show that if we spe¬ 
cialize to the interval [-ji, ji] and use the ordinary Fourier coefficients, then 
Parseval's equation (14) takes the form 

1 ^ 1 °° 

-jlf(x)fdx = -al + ' S ^{a 2 n + bl). (17) 

—71 1 

The function/(x) in this equation is assumed to belong to R, that is, to be 
Riemann integrable on [-ji, ir] and for any such function its square [fix)] 2 is 
also automatically integrable. It therefore follows from (17) that for this func¬ 
tion the Fourier coefficients a 0 , a u b 1 a 2 , b 2 ,... have the property that the series 

+ b;f converges. Of course, we already knew this from Problem 1 in 

Section 37. 

However, if the Riemann integral is replaced by its more powerful cousin 
the Lebesgue integral, then this statement has a converse that was proved by 
F. Riesz and E. Fischer in 1907. The famous Riesz-Fischer theorem, one of the 
great achievements of the Lebesgue theory of integration, states that given 

any sequence of numbers a 0 , a v b 1 a 2 , b 2/ ... such that the series 2>’ +b ” } 

converges, there exists a unique square-integrable function fix) with these 
numbers as its Fourier coefficients. 

It is customary to use the symbol L 2 to denote the space of functions fix) 
that are square-integrable on [-ir, ir] in the sense of Lebesgue, where as usual 
two functions are considered to be equal if they differ by a null function. 17 
When Parseval's equation (17) and the Riesz-Fischer theorem are taken 
together, we see from this discussion that they give a very simple charac¬ 
terization of the functions in L 2 in terms of their Fourier coefficients. It is 
remarkable that no other important class of functions has a characterization 
of comparable simplicity and completeness—a fact that delights the souls of 
mathematicians. 

NOTE ON PARSEVAL. Marc-Antoine Parseval des Chenes (1755-1836), 
member of an aristocratic French family and ardent royalist, poet, and ama¬ 
teur mathematician, managed to survive the French Revolution with his 


17 It should be pointed out that L 2 contains R and many other functions as well, and that when¬ 
ever the Lebesgue integral is applied to a function in R, it yields the same numerical result 
as the Riemann integral. 



Fourier Series and Orthogonal Functions 


343 


head still on his shoulders, but was imprisoned briefly in 1792 and luckily 
fled the country when Napoleon ordered his arrest for publishing poetry 
attacking the regime. He published very little mathematics—and none of 
any distinction—but this little included (in 1799) a rough statement that 
only slightly resembles Parseval's equation as it is known to mathematicians 
today throughout the world... and for this his name is immortal. 


Problems 

1. Consider the sequence of functions /„(x), n - 1,2, 3,..., defined on the 
interval [0,1] by 


f 0 , 


/»(*) = 



0 < x < 1 jn, 

1 Jn<x< 2/m, 
2/m < x < 1. 


(a) Show that the sequence \f„{x)\ converges pointwise to the zero func¬ 
tion on the interval [0,1]. 

(b) Show that the sequence {/„(*)} does not converge in the mean to the 
zero function on the interval [0,1]. 

r i 

2. Consider the following sequence of closed subintervals of [0,1]: 0, — 


-4 


n 1 

1 1 

1 3 

3 i 

n 1 

' °'4 

4 2 

2'4 ' 

4 4 ' 

V 


,..., and denote the nth 


subinterval by I n . Now define a sequence of functions/,, (x) on [0,1] by 


fn ( X ) 


for x in t„, 
for x not in I„. 


(a) Show that the sequence {/„(x)} converges in the mean to the zero 
function on the interval [0,1]. 

(b) Show that the sequence {/„(x)} does not converge pointwise at any 
point of the interval [0,1]. 

3. Obtain the formula h k = a k from both (8) and (9), by using the fact that 
dE n /db k - 0 when E„ has a minimum value. 

4. The function/(x) = 1 is to be approximated on [0, jt] by p(x) = b t sin x + 
b 2 sin 2x + b 3 sin 3x + b 4 sin 4x + b 5 , sin 5x in such a way that j/[l - p(x)] 2 dx 
is minimized. What values should the coefficients b k have? 
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5. The function/(x) = x is to be approximated on [0, it] by 
p(x) = b 1 sin x + b 2 sin 2x + b 3 sin 3x 


in such a way that I [x - p(x)] 2 dx is minimized. What values should 

J o 

the coefficients b k have? 

6. Show that Parseval's equation (14) has the form (17) when the ortho¬ 
normal sequence {cf>„ (x)[ is the trigonometric sequence (15). 

7. Obtain the sums 


z 


6 


and 



90 


by applying Parseval's equation in the preceding problem to the two 
Fourier series 


. sin 2x sin 3x 
x = 2 sin x-+- 


and 

2 oo 

x 2 = y + 4 Z ( - 1 ),,c v zx - 

[These series are found in Example 33-1 and Problem 35-10(a).] 

8. Use the method and results of Problem 7 to obtain the sum 

•ST' 1 _ n 6 

4^-945 

from the sine series for x 2 [Problem 35-10(b)]. 

9. Use the method and results of Problems 7 and 8 to obtain the sum 

Y 1 _ 7I 8 

4^7?” 9450 


from the cosine series for x 4 [Problem 35-12(a)]. 
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Appendix A. A Pointwise Convergence Theorem 

We divide the work of stating and proving the theorem into stages, for easier 
comprehension. 

1. Our first purpose is to obtain a convenient explicit formula for the differ¬ 
ence between a function and the nth partial sum of its Fourier series. This 
formula will enable us to prove pointwise convergence for a large class of 
functions that includes all the examples given in this chapter. 

To develop this formula, we begin by assuming only that f(x) is an inte¬ 
grate function of period 2 ji. The nth partial sum of its Fourier series is then 



k=l 


( 1 ) 


where 


n 




and 


By substituting (2) into (1) we obtain 



-n 



( 3 ) 


If we define the Dirichlet kernel by 



k =1 


( 4 ) 


then (3) can be put in the more compact form 


K 



( 5 ) 
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Putting u - t — x in (5) yields 


n-x 

s„(x) = — f(x + u)D n (u)du. 

n J 


( 6 ) 


By the definition (4), D n (u) has period 2n; and as a function of u,f(x + u) also 
has period 2 ji. Therefore the integral of f(x + u)D n (u) over any interval of 
length 2 n equals the integral over any other interval of length 2 it, and (6) can 
be written 


71 

s n (x) = — f(x + u)D n (u)du. 

K J 


(7) 


Since D„(-w) = D„(u), we can replace u by -u in (7) to obtain 


-71 

S n(x )=— \f(x-u)D„(u)du 

n J 

71 


1 

n 


K 

jf(x-u)D„(u)du, 

—71 


( 8 ) 


and adding (7) and (8) yields 


7U 

2 s n (x) = — [f(x + u) + f(x-u)]D n (u)du. 

n J 

—71 


The integrand here is an even function of u, so the integral from -ji to % is 
twice the integral from 0 to n, and we have 

71 

s n (x) = -\[f(x + u) + f(x-u)]D n (u)du. (9) 

n J 

o 

To bring/(x) into our discussion and put the difference s n (x) -f(x) into a con¬ 
venient form, we notice that 


1 

n 



1 

2' 
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since the terms cos ku in (4) integrate to zero. If we now multiply this by 2f(x) 
we obtain 


f(x) = -{2f(x)D n (u)du, (10) 

71 J 

0 


and subtracting (10) from (9) yields 


s n {x)-f(x) = - \[f(x + u) + f(x-u)-2f(x)]D n (u)du. (11) 

K J 

0 

This formula is our fundamental tool for studying the convergence of s n (x) 
to fix). 

2. At this point we need the following closed formula for the Dirichlet 
kernel (4), 


D n (u) = 


1 ^ 

—+ y cos ku 

^ Jc=l 


sin(« + \)u 
2sin 4-n 


if sin — u * 0 18 . This enables us to write (11) in the form 


1 n , ^ 

S„(x) - f(x) = - jg(w) sinf n + - J u du, 


where 


g(u) 


f(x + u) + f(x -u)- 2 f(x) 

1 ■ 
2 sin — u 

2 


( 12 ) 


(13) 


(14) 


Of course, g(u) is really a function of both u and x. However, we are going 
to be examining g(u) with x fixed and u variable, and this notation helps to 
avoid confusion. In view of (13), to prove that s n (x) ->• fix) as n -> we must 
prove that 


18 This formula can easily be proved by writing down the identity 2 cos A sin B = sin (A + B) 
sin (A - B) n times, with A = u, 2 u, 3 nu and B = u/2, and adding the results to obtain 
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lim 

n —>00 


71 



0 



u du = 0. 


(15) 


Our task is to give a rigorous proof of (15) with appropriate, understandable, 
and clearly stated assumptions about the behavior of the function/(x). 


3. As a preliminary to the proof of the main convergence theorem stated 
below, we need the following lemma. 

Lemma. Iffy (u) is integrable on the interval [0, it], then 


lim j*(|>(M)sinf n + ^ \udu =0. 


(16) 


Proof. By the addition formula for the sine, this integral can be broken up 
into 


K K 

J <K«) cos — 2 <-sin nudu + J<K«) sin — 


— u • cos nu du. 
2 


If we write 


and 


A, 



cos nu du 



■sinnudu, 


then the integral (16) is 


| (A +B„). 

1 

It is easy to see that A n is the nth coefficient in the cosine series for (|) (li) sin u, 

1 2 

and B n is the nth coefficient in the sine series for fy(n) cos —u. Since fy(u) is integra¬ 
ble, each of these functions is also integrable. It now follows from the corollary 
to Bessel's inequality stated at the end of Problem 37-1 that A n -*■ 0 and B n -^0 as 
n-*<*>, and the proof of (16) is complete. 
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4. In view of condition (15) and the lemma, all that remains is to formulate 
assumptions sufficient to guarantee that the function g(u ) defined by (14) is 
integrable on [0, it]. 

So far, we have only the general requirements that /(x) is integrable on 
[-it, it] and periodic with period 2it. We now make the further assumption 
that /(x) is piecewise smooth on [- it, it]. This means that the graph on [-it, it] 
consists of a finite number of continuous curves on each of which f"(x) exists 
and is continuous. It also means that the derivative exists at the endpoints of 
these curves, in the sense of 

lim /(* + -)-/(»> and i im /(*-")-/(*-) (17) 

u —>0+ u u-» 0+ —U 

In this way, the function/(x) is guaranteed to have a right derivative and a 
left derivative at every point x—including points of discontinuity—which 
we denote by /+(x) and f!_ (x). 

Of course, the function/(x) is allowed to have a finite number of jump dis¬ 
continuities on [-ir, it]. However, since the Fourier coefficients are not changed 
if/(x) is redefined at a finite number of points, we may assume without loss 
of generality that 


/(*) = 


f (x~) + f (x+) 
2 


(18) 


at every point x, whether/(x) is continuous at x or not. 

Our pointwise convergence theorem can now be stated as follows. 

Theorem. Iff(x) is piecewise smooth on ]-ir, it], is periodic with period 2it, and is 
defined at points of discontinuity by (18), then the Fourier series o//(x) converges to 
f(x) at every point x. 

5. To prove this theorem, let x be any fixed point. We wish to establish the 
correctness of (15), and in view of the lemma, it suffices to show that the 
function 


£(“) 


/(x + u) + f(x - u) - 2/(x) 

\ 

2 sin — u 
2 


(19) 


is integrable on [0, it]. It is clear that the only doubt about integrability arises 
1 

from the fact that sin —u = 0 when u - 0—for elsewhere in the interval, 
1 2 

sin — u is continuous and positive, and the numerator of (19) is certainly an 
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integrable function of u on [0, it]. We see from these remarks that g(u) will be 
integrable on [0, Jt] if we can show thatg(i;) approaches a finite limit as u -* 0+. 
By using (18) we can write 


f(x + u) + f(x-u)~ f(x-) - f(x+) 

2 sin — u 
2 


f(x + u)-f(x+) | f(x-u)-f(x-) 
u u 



sm —u 


2 


But as u 0+, (17) tells us that 

n* + u)-f{x+) ^ and f[x-u)-f{x-) 

u u 

and we know that 


1 

— u 

2 -> 1 . 


1 

sm — u 
2 


It therefore follows that 


g(u) -» fl(x)- f-(x), 


so g(u) is integrable on [0, it] and the proof is complete. 











Chapter 7 

Partial Differential Equations and 
Boundary Value Problems 


39 Introduction. Historical Remarks 

The theory of Fourier series discussed in the preceding chapter had its his¬ 
torical origin in the middle of the eighteenth century, when several mathema¬ 
ticians were studying the vibrations of stretched strings. The mathematical 
theory of these vibrations amounts to the problem of solving the partial dif¬ 
ferential equation 


a 


2 


d 2 y _ 0 2 IJ 
dx 2 dt 2 ’ 


( 1 ) 


where a is a positive constant. This one-dimensional wave equation has many 
solutions, and the problem, for a particular vibrating string, is to find the 
solution that satisfies certain preliminary conditions associated with this 
string, such as its initial shape, its initial velocity, etc. The solution then 
describes the subsequent motion of the string as it vibrates under tension. 
The equilibrium position of the string is assumed to be along the x-axis, and 
if y = y(x, t ) is the desired solution of (1), then for a fixed value of f>0 the 
curve y = y(x, t) gives the shape of the displaced string at that moment (see the 
dashed curve in Figure 52), and this shape changes from moment to moment. 

For the case of a string stretched between the points x = 0 and x = it, and 
then deformed into an arbitrary shape and released at the moment t- 0, 
Daniel Bernoulli (in 1753) gave the solution of (1) as a series of the form 

y = b l sin x cos at + b 2 sin 2x cos 2at +■■■. (2) 

It is easy to verify by inspection that a typical term of this series, 
b n sin nx cos nat, is a solution of equation (1). Further, every finite sum of such 
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y 



y = y(x, 0) 


0 


► 

x 


FIGURE 52 


terms is a solution, and the series (2) will also be a solution if term-by-term 
differentiation of the series is justified. 1 When t = 0, the series (2) reduces to 


y = b 1 sin x + b 2 sin lx + ■ ■ ■. 


This should give the initial shape of the string, that is, the curve y = y{x, 0) into 
which the string is deformed at the moment t = 0 when the string is released 
and the vibrations begin (see the solid curve in Figure 52). 

However, d'Alembert (in 1747) and Euler (in 1748) had already published 
solutions of the problem which, for the case stated above, have the form 



( 3 ) 


Here the curve y =/(x) is assumed to be the shape of the string at time t = 0; 
also, the function/(x) is assumed to be defined outside the interval [0, ji] by 
the requirement that it is an odd function of period 2 ji, that is. 


/(-x) = -f{x) and /(x + 2k)- fix). 


If we compare the solution of Bernoulli with that of d'Alembert and Euler, 
then we see at once that we ought to have 


f{x) = b 1 sin x + b 2 sin 2x + ■ ■ •, 


( 4 ) 


because this is what we get if the solutions (2) and (3) agree at time t = 0. 
Therefore, as a result of mathematically analyzing this physical problem, 
Bernoulli arrived at an idea that has had very far-reaching influence on the 


1 In Bernoulli's time no mathematicians had any doubt that infinite series of functions can be 
differentiated freely term-by-term. Such doubt was the product of a later, more skeptical, and 
more sophisticated age. 
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history of mathematics and physical science, namely, the possibility that a 
function as general as the shape of an arbitrarily deformed taut string can be 
expanded in a trigonometric series of the form (4). 

Both d'Alembert and Euler rejected Bernoulli's idea, and for essentially the 
same reason. It is clear on physical grounds that there is a great amount of 
freedom in the way the string can be constrained in its initial position. For 
example, if the string is plucked aside at a single point, then the shape will 
be a broken line (Figure 53(a)); and if it is pushed aside by using a circular 
object of some kind, then the shape will be partly a straight line, partly an 
arc of a circle, and partly another straight line, as in Figure 53(b). Is it reason¬ 
able to expect that the single "formula" or "analytic expression" (4) could 
represent a straight line on part of the interval [0 ,ji], a circle on another part, 
and a second straight line on still another part? To the mathematicians of 
that time (except Bernoulli) this seemed absurd. To d'Alembert the curve 
in Figure 53(b) would have represented three separate graphs of three dis¬ 
tinct functions, merely pieced together. To Euler it would have been a single 
graph, but of three functions rather than a single function. Both dismissed 
the possibility that such a graph could be represented by a single "reason¬ 
able" function like the series (4). The controversy bubbled on for many years, 
and in the absence of mathematical proofs, no one converted anyone else to 
his way of thinking. 




FIGURE 53 
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The more general form of a trigonometric series containing both sines and 
cosines, namely. 


f( x ) - ~ a o + 


00 

^ (a„ cos nx + b n sin nx), 

H=1 


( 5 ) 


arises naturally in another physical problem, that of the conduction of heat. 
In 1807 the French physicist-mathematician Fourier announced in this con¬ 
nection that an "arbitrary function" f(x) can be represented in the form (5), 
with coefficients given by the formulas 


tt n ~ 

n 


-f/w 

71 J 


cos nxdx and 


b n =~ 


J fix) si 


sinnxdx. 


( 6 ) 


No one believed him, and for the next 15 years he labored at the task of 
accumulating empirical evidence to support his assertion. The results were 
presented in his classic treatise, Theorie Analytique de la Chaleur (1822). Fie 
supplied no proofs, but instead heaped up the evidence of many solved prob¬ 
lems and many convincing specific expansions—so many, indeed, that the 
mathematicians of the time began to spend more effort on proving, rather 
than disproving, his conjecture. The first major result of this shift in the 
winds of opinion was the classical paper of Dirichlet in 1829, in which he 
proved with full mathematical rigor that the series (5) actually does converge 
to the function/(x) for all continuous functions whose graphs consist of a 
finite number of increasing or decreasing pieces—in particular, for the func¬ 
tions illustrated in Figure 53. Thus were Bernoulli and Fourier vindicated. 
We must add, however, that Euler found formulas (6) in 1777, but believed 
them to be valid only in the case of functions/(x) already known to be repre¬ 
sented in the form (5). 

As we know from Chapter 6, in recognition of Fourier's pioneering 
tenacity a trigonometric series of the form (5) is called a Fourier series if 
its coefficients are calculated by formulas (6) from some given integrable 
function/(x). 

Those readers who would like a more detailed description of these memo¬ 
rable events in our intellectual history are urged to consult any (or all) of 
the following masterly accounts: Philip J. Davis and Reuben Flersh, The 
Mathematical Experience, Floughton Mifflin Co., Boston, 1982, pp. 255-270; 
Bela Sz.-Nagy, Introduction to Real Functions and Orthogonal Expansions, Oxford 
University Press, 1965, pp. 375-380; and particularly Bernhard Riemann, in 
A Source Book In Classical Analysis, ed. Garrett Birkhoff, Flarvard University 
Press, 1973, pp. 16-21. 
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In the next section and its problems we present an organized exposi¬ 
tion of the theory of the vibrating string sketched above; and in the sec¬ 
tions after that we turn to other applications of Fourier series in physics and 
mathematics. 

NOTE ON d'ALEMBERT. Jean le Rond d'Alembert (1717-1783) was a French 
physicist, mathematician, and man of letters. In science he is remembered 
for d'Alembert's principle in mechanics and his solution of the wave equation. 
The main work of his life was his collaboration with Diderot in preparing 
the latter's famous Encyclopedic, which played a major role in the French 
Enlightenment by emphasizing science and literature and attacking the 
forces of reaction in church and state. D'Alembert was a valued friend of 
Euler, Lagrange, and Laplace. 


40 Eigenvalues, Eigenfunctions, and the Vibrating String 

We begin by seeking a nontrivial solution y(x) of the equation 


y"+ty =o (i) 

that satisfies the boundary conditions 

y{0) = 0 and y(jt) = 0. (2) 

The parameter X in (1) is free to assume any real value whatever, and part 
of our task is to discover the X's for which the problem can be solved. In our 
previous work we have considered only initial value problems, in which the 
solution of a second order equation is sought that satisfies two conditions at 
a single value of the independent variable. Here we have an entirely different 
situation, for we wish to satisfy one condition at each of two distinct values 
of x. Problems of this kind are called boundary value problems, and in general 
they are more difficult and far-reaching—in both theory and practice—than 
initial value problems. 

In the problem posed by (1) and (2), however, there are no difficulties. If X 
is negative, then Theorem 24-B tells us that only the trivial solution of (1) can 
satisfy (2); and if X = 0, then the general solution of (1) is y(x) = c,x + c 2 , and we 
have the same conclusion. We are thus restricted to the case in which X is 
positive, where the general solution of (1) is 


y(x) = Ci sin a JXx + c 2 cos fXx; 
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and since y( 0) must be 0, this reduces to 

y(x) = Ci sin -JXx. (3) 

Thus, if our problem has a solution, it must be of the form (3). For the second 
boundary condition y(jt) = 0 to be satisfied, it is clear that \fXn must equal m 
for some positive integer n, so X=n 2 . In other words, X must equal one of the 
numbers 1,4,9,.... These values of X are called the eigenvalues of the problem, 
and corresponding solutions 

sin x, sin 2x, sin 3x, ... (4) 

are called eigenfunctions. It is clear that the eigenvalues are uniquely deter¬ 
mined by the problem, but that the eigenfunctions are not; for any nonzero 
constant multiples of (4), say a 1 sin x, a 2 sin 2x, a 3 sin 3x, ..., will serve just 
as well and are also eigenfunctions. For future reference we notice two 
facts: the eigenvalues form an increasing sequence of positive numbers that 
approaches and the nth eigenfunction, sin nx, vanishes at the endpoints of 
the interval [0, ji] and has exactly n - 1 zeros inside this interval. 

We now examine the classical problem of mathematical physics described 
in the preceding section—that of the vibrating string. Our purpose is to 
understand how eigenvalues and eigenfunctions arise. Suppose that a flex¬ 
ible string is pulled taut on the x-axis and fastened at two points that for 
convenience we take to be x = 0 and x = n. The string is then drawn aside into 
a certain curve y=f(x) in the xy-plane (Figure 54) and released. In order to 
obtain the equation of motion, we make several simplifying assumptions, 
the first of which is that the subsequent vibration is entirely transverse. This 
means that each point of the string has constant x-coordinate, so that its 
y-coordinate depends only on x and the time t. Accordingly, the displace¬ 
ment of the string from its equilibrium position is given by some function 
y = y(x, f), and the time derivatives dy/dt and d 2 y/dt 2 represent the string's 
velocity and acceleration. We consider the motion of a small piece which in 
its equilibrium position has length Ax. If the linear mass density of the string 



FIGURE 54 
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is m = m(x), so that the mass of the piece is m Ax, then by Newton's second law 
of motion the transverse force F acting on it is given by 

F = mAx—(5) 
dt 2 v 

Since the string is flexible, the tension T = T(x) at any point is directed along 
the tangent (see Figure 54) and has T sin 0 as its [/-component. We next 
assume that the motion of the string is due solely to the tension in it. As a 
consequence, F is the difference between the values of T sin 0 at the ends of 
our piece, namely A(T sin 0), so (5) becomes 


A(T sin 0) = m Ax —\. 
v ' Dt 2 


( 6 ) 


If the vibrations are relatively small, so that 0 is small and sin 0 is approxi¬ 
mately equal to tan Q = dy/dx, then (6) yields 


A (T dy/dx) _ ^ d 2 y . 


Ax 


dt 2 


and when Ax is allowed to approach 0, we obtain 




— T — = m 


dx 1 3x 


dt 2 


( 7 ) 


Our present interest in this equation is confined to the case in which both m 
and T are constant, so that the equation can be written 


2 d 2 y d 2 y 

a —f = —f 

dx 2 dt 2 


( 8 ) 


with a = Jr/m. For reasons that will emerge in the Problems, equation (8) is 
called the one-dimensional wave equation. We seek a solution y(x,t ) that satisfies 
the boundary conditions 


y(0,f) = 0 


( 9 ) 


and 


y(n,t)-0. 


( 10 ) 
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and the initial conditions 


% 

-t=o 


= 0 


and 


y(x,Q) =/(x). 


( 11 ) 


( 12 ) 


Conditions (9) and (10) express the assumption that the ends of the string are 
permanently fixed at the points x = 0 and x-it and (11) and (12) assert that 
the string is motionless when it is released and that y-f(x ) is its shape at that 
moment. We note explicitly, however, that none of these conditions are in any 
way connected with the derivation of (7) and (8). 

We shall give a formal solution of (8) by the method of separation of vari¬ 
ables. This amounts to looking for solutions of the form 

y(x,t) = u(x)v(t), (13) 

which are factorable into a product of functions each of which depends 
on only one of the independent variables. When (13) is substituted into (8), 
we get 


a 2 u"(x)v(t) = u(x)v"(t) 


or 


u\x)_ 1 v\t) 
u(x) a 2 v(t) 

Since the left side is a function only of x and the right side is a function only 
of t, equation (14) can hold only if both sides are constant. If we denote this 
constant by -X, then (14) splits into two ordinary differential equations for 
u(x) and v(t): 


u" +Xu-0 (15) 

and 

v" +Xa 2 v-Q. (16) 

It is possible to satisfy (9) and (10) by solving (15) with the boundary condi¬ 
tions u(0) = 11 ( 71 ) = 0. We have already seen that this problem has a nontrivial 
solution if and only if X-n 2 for some positive integer n, and that correspond¬ 
ing solutions (the eigenfunctions) are 


u„(x) - sin nx. 
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Similarly, for these A,'s (the eigenvalues) the general solution of (16) is 

v(t) = Cj sin nat + c 2 cos nat; 

and if we impose the requirement that v'(0) = 0, so that (11) is satisfied, then 
c 1 - 0 and we have solutions 


v„(t) = cos nat. 

The corresponding products of the form (13) are therefore 

y n (x, t) = sin nx cos nat. 

Each of these functions, for n = 1, 2, ..., satisfies equation (8) and conditions 
(9), (10), and (11); and it is easily verified that the same is true for any finite 
sum of constant multiples of the y n : 

b 1 sin x cos at + b 2 sin lx cos lat +••■ + &„ sin nx cos nat. 

If we proceed formally—that is, ignoring all questions of convergence, term- 
by-term differentiability, and the like—then any infinite series of the form 


y(x,t) = b„ sin nx cos nat = b^ sin x cos 


at 


+ b 2 sin 2x cos lat + --- + b„ sin nx cos nat+ ■•■ 


(17) 


is also a solution that satisfies (9), (10), and (11). This brings us to the final 
condition (12), namely, that for f = 0 our solution (17) should yield the initial 
shape of the string: 

f(x) = b 1 sin x + b 2 sin 2x + ■ ■ ■ +b n sin nx + ■■■. (18) 

As we said in the preceding section, when these formulas were developed 
by Daniel Bernoulli in 1753, it seemed to many mathematicians that (18) 
ought to be impossible unless/(x) were a function of some very special type. 
During the next century it became clear that this opinion was mistaken, and 
that in reality expressions of the form (18) are valid for very wide classes of 
functions/(x) that vanish at 0 and k. Assuming that this is true, the problem 
remained for Bernoulli and his contemporaries of finding the coefficients b n 
when the function/(x) is given. This problem was solved by Euler in 1777, 
and his solution launched the vast subject of Fourier series. We know how 
to find these coefficients from our work in Section 35, but we shall find them 
again by methods that fit into a broader pattern of ideas. 
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The eigenfunctions u m {x) and u u (x), that is, sin mx and sin nx, satisfy the 
equations 


u" m = -m 2 u m and u"„ = -n 2 u„. 

If the first equation is multiplied by u n and the second by u m , then the differ¬ 
ence of the resulting equations is 

u„u" m -u m u " ={n 2 -m 2 )u m u n 


or 

(u n u’ m - u m u' n y = ( n 2 - rn 2 )u m u„. (19) 

On integrating both sides of (19) from 0 to it and using the fact that 
u m (x) = sin mx and u n (x) - sin nx both vanish at 0 and it, we obtain 


7t 

’>1 


(n 2 - m 2 ) u m (x)u n (x)dx = [u„(x)u' m (x)- u m (x)u' n (x)]o = 0, 


so 


n 

1 


sin mx sin nx dx = 0 when m^n. 


( 20 ) 


This result suggests multiplying (18) through by sin nx and integrating the 
result term by term from 0 to it. When these operations are carried out, (20) 
produces a wholesale disappearance of terms, leaving only 

K K 

\f(x) sin nx dx = h n J* sin 2 nx dx-, 

o o 


and since 


we have 


7C n 

J" sin 2 nx dx = — J* (1 - cos 2 nx) dx - 


it 

2 ' 


n 



sin nxdx. 


o 


b, 


( 21 ) 
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These b„ are very familiar to us and are called the Fourier coefficients of f(x). 
With these coefficients, (18) is the Fourier sine series of f(x) or the eigenfunc¬ 
tion expansion of f(x) in terms of the eigenfunctions sin nx, and (17) is called 
Bernoulli's solution of the wave equation. 

The above "solution" of the wave equation is clearly riddled with doubt¬ 
ful procedures and unanswered questions, so much so, indeed, that from 
a strictly rigorous point of view it cannot be regarded as having more than 
a suggestive value. But even this much is well worth the effort, for some of 
the questions that arise—especially those about the meaning and validity of 
(18)—are exceedingly fruitful. For instance, if the b n are computed by means 
of (21) and used to form the series on the right of (18), under what circum¬ 
stances will this series converge? And if it converges at a point x, does it nec¬ 
essarily converge to/(x)? We give the following brief statement of one answer 
to these questions that is fully covered by the theorem proved in Appendix 
A at the end of the preceding chapter. 

The function/(x) under consideration is defined on the interval [0 ,ji] and 
vanishes at the endpoints. Suppose that f(x) is continuous on the entire inter¬ 
val, and also that its derivative is continuous with the possible exception of 
a finite number of jump discontinuities, where the derivative approaches finite 
but different limits from the left and from the right. In geometric language, 
the graph of such a function is a continuous curve with the property that the 
direction of the tangent changes continuously as it moves along the curve, 
except possibly at a finite number of "corners" where its direction changes 
abruptly. Under these hypotheses the expansion (18) is valid; that is, if the b n 
are defined by (21), then the series on the right converges at every point to 
the value of the function at that point. The need for a carefully constructed 
theory can be seen from the fact that if f(x) is merely assumed to be continu¬ 
ous, and nothing is said about its derivative, then it is known to be possible 
for the series on the right of (18) to diverge at some points. 2 

Another line of investigation considers the possiblity of eigenfunction 
expansions like (18) for other boundary value problems. If we put aside the 
issue of the validity of such expansions, then the main problem becomes that 
of showing in other cases that we have an adequate supply of suitable build¬ 
ing materials, i.e., a sequence of eigenvalues with corresponding eigenfunc¬ 
tions that satisfy some condition similar to (20). 

Suppose, for instance, we consider the vibrating string studied above 
with one significant difference: the string is nonhomogeneous, in the sense 
that its density m = mix) may vary from point to point. In this situation, (8) is 
replaced by 


d 2 y _ m(x) d 2 y 
d? “ T dt 2 ' 


2 It has been known since 1966 that there even exists a continuous function whose Fourier 
series diverges at every rational point in [0,ic]. 
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If we again seek a solution of the form (13), then (22) becomes 

u"(x) _ 1 v*(t ), 

m(x)u(x) T v(t) ' 

and as before, we are led to the following boundary value problem: 

u" + Xm(x)u = 0, w(0) = u(n) = 0. (23) 

What are the eigenvalues and eigenfunctions in this case? Needless to say, 
we cannot give precise answers without knowing something definite about 
the density function m(x). But at least we can prove that these eigenvalues 
and eigenfunctions exist. The details of this argument are given in Appendix 
A at the end of this chapter. 


Problems 

1. Find the eigenvalues X n and eigenfunctions y„(x) for the equation 
y" + A,y = 0 in each of the following cases: 

(a) y(0) = 0, y(jt/2) = 0; 

(b) y(0) = 0, y(2jt) = 0; 

(c) y(0) = 0, y(l) = 0; 

(d) y(0) = 0, y(L) = 0 when L > 0; 

(e) y(-L) = 0, y(L) = 0 when L > 0; 

(f) y(a) = 0, y(b) = 0 when a<b. 

Solve the following two problems formally, i.e., without considering 
such purely mathematical issues as the differentiability of functions 
and the convergence of series. 

2. If y = F(x) is an arbitrary function, then y=F(x + at) represents a wave 
of fixed shape that moves to the left along the x-axis with velocity a 
(Figure 55). Similarly, if y = G(x) is another arbitrary function, then 
y = G(x - at) is a wave moving to the right, and the most general one¬ 
dimensional wave with velocity a is 

y(x,t) = F(x + at) + G(x - at). (*) 

(a) Show that (*) satisfies the wave equation (8). 

(b) It is easy to see that the constant a in equation (8) has the dimen¬ 
sions of velocity. Also, it is intuitively clear that if a stretched string 
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y 


at 



y = F(x + at) 



y = F(x) 


x 


FIGURE 55 


is disturbed, then waves will move in both directions away from 
the source of the disturbance. These considerations suggest intro¬ 
ducing the new variables a-x + at and |S = x - at. Show that with 
these independent variables, equation (8) becomes 


5a dp 


and from this derive (*) by integration. Formula (*) is called 
d'Alembert's solution of the wave equation. It was also obtained by 
Euler, independently of d'Alembert but slightly later. 

3. Consider an infinite string stretched taut on the x-axis from to °°. Let 
the string be drawn aside into a curve y =/(x) and released, and assume 
that its subsequent motion is described by the wave equation (8). 

(a) Use (*) to show that the string's displacement is given by d'Alembert's 
formula, 



n 


Hint: Remember the initial conditions (11) and (12). 

(b) Assume further that the string remains motionless at the points 
x = 0 and x = n (such points are called nodes), so that y(0,f) = y(n,t) = 0, 
and use (**) to show that f(x) is an odd function that is periodic with 
period 2it [that is,/(-x) = -/(x) and f(x + 2n) =/(x)]. 

(c) Show that since/(x) is odd and periodic with period 2jt, it necessar¬ 
ily vanishes at 0 and it. 

(d) Show that Bernoulli's solution (17) can be written in the form of (**). 
Hint: 1 sin nx cos nat-sin [«(x + flt)] + sin [n(x - at)]. 

4. Consider a uniform flexible chain of constant mass density m 0 hang¬ 
ing freely from one end. If a coordinate system is established as in 
Figure 56, then the lateral vibrations of the chain, when it is disturbed. 
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FIGURE 56 


are governed by equation (7). In this case, the tension T at any point 
is the weight of the chain below that point, and is therefore given by 
T = m 0 xg, where g is the acceleration due to gravity When m 0 is can¬ 
celed, (7) becomes 


d_ 

dx 




d 2 y 

dt 2 ' 


(a) Assume that this partial differential equation has a solution of the 
form y(x,t)-u(x)v(t), and show as a consequence that u(x) satisfies 
the following ordinary differential equation: 


d_ 

dx 


gx 


du 

dx 


+ A ,u = 0. 




(b) If the independent variable is changed from x to z = 2 yjXx/g, show 
that equation (***) becomes 
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z 



du 

dz 


+ zu = 0, 


which (apart from notation) is Bessel's equation l-(9) for the special 
case in which p - 0 


5. Solve the vibrating string problem in the text if the initial shape (12) is 
given by the function 

[ lexjn, 0 < x < n/2, 

[2c(7i-x)/7i, n/2<x<n; 


(a) /(x) = 


(b) /(x) = —x(7i-x); 
n 


(c) /(x) = - 


X, 

7l/4, 

n-x, 


0 < x < jt/4, 

7i/4 < x < 3 tt/4 , 
3tc/4 <x<n. 


In each case, sketch the initial shape of the string. 

6. Solve the vibrating string problem in the text if the initial shape (12) is 
that of a single arch of a sine curve, /(x) = c sin x. Show that the mov¬ 
ing string always has the same general shape. Do the same for func¬ 
tions of the form/(x) = c sin nx. Show, in particular, that there are n - 1 
points between x = 0 and x = it at which the string remains motionless; 
these points are called nodes, and these solutions are called stand¬ 
ing waves. Draw sketches to illustrate the movement of the standing 
waves. 


7. The problem of the struck string is that of solving equation (8) with the 
boundary conditions (9) and (10) and the initial conditions 


% 

- 1 =o 


= g(x) 


and 


y(x,0)= 0. 


(These initial conditions mean that the string is initially in the equilib¬ 
rium position, and has an initial velocity g(x) at the point x as a result of 
being struck.) By separating variables and proceeding formally, obtain 
the solution 


y(x,f) = ^c„ 

i 


sin nx sin nat 


where 


2 

nna 


K 

jVw 


sin nxdx. 


c, 
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41 The Heat Equation 

When we study the flow of heat in thermally conducting bodies, we encoun¬ 
ter an entirely different type of problem leading to a partial differential 
equation. 

In the interior of a body where heat is flowing from one region to another, 
the temperature generally varies from point to point at any one time, and 
from time to time at any one point. Thus, the temperature zv is a function 
of the space coordinates x, y, z and the time f, say w - w(x,y,z,t)- The precise 
form of this function naturally depends on the shape of the body, the ther¬ 
mal characteristics of its material, the initial distribution of temperature, and 
the conditions maintained on the surface of the body. The French physicist- 
mathematician Fourier studied this problem in his classic treatise of 1822, 
Theorie Analytique de la Chaleur. He used physical principles to show that the 
temperature function w must satisfy the heat equation 



( 1 ) 


^ dx 2 dy 2 5z 2 J dt 


We shall retrace his reasoning in a simple one-dimensional situation, and 
thereby derive the one-dimensional heat equation. 

The following are the physical principles that will be needed: 

(a) Heat flows in the direction of decreasing temperature, that is, from 
hot regions to cold regions. 

(b) The rate at which heat flows across an area is proportional to the 
area and to the rate of change of temperature with respect to dis¬ 
tance in a direction perpendicular to the area. (This proportional¬ 
ity factor is denoted by k and called the thermal conductivity of the 
substance.) 

(c) The quantity of heat gained or lost by a body when its temperature 
changes, that is, the change in its thermal energy, is proportional to 
the mass of the body and to the change of temperature. (This pro¬ 
portionality factor is denoted by c and called the specific heat of the 
substance.) 

We now consider the flow of heat in a thin cylindrical rod of cross-sectional 
area A (Figure 57) whose lateral surface is perfectly insulated so that no heat 
flows through it. This use of the word "thin" means that the temperature is 
assumed to be uniform on any cross section, and is therefore a function only 
of the time and the position of the cross section, say w - w(x,t). We examine 
the rate of change of the heat contained in a thin slice of the rod between the 
positions x and x + Ax. 
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FIGURE 57 

If p is the density of the rod, that is, its mass per unit volume, then the mass 
of the slice is 


A m = p A Ax. 

Furthermore, if Azc is the temperature change at the point x in a small time 
interval At, then (c) tells us that the quantity of heat stored in the slice in this 
time interval is 


AH = c Am Aw = cpA Ax Aw, 


so the rate at which heat is being stored is approximately 


AH 

At 


= cpA Ax 


Aw 
At ' 


( 2 ) 


We assume that no heat is generated inside the slice—for instance, by 
chemical or electrical processes—so that the slice gains heat only by means 
of the flow of heat through its faces. By (b) the rate at which heat flows into 
the slice through the left face is 


-kA 


dw 

dx 


The negative sign here is chosen in accordance with (a), so that this quantity 
will be positive if dw/dx is negative. Similarly, the rate at which heat flows 
into the slice through the right face is 


, . dw 
kA — 
dx 


so the total rate at which heat flows into the slice is 


kA 


dw 

dx 


-kA 


dw 

dx 


x+Ax 


( 3 ) 
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If we equate the expressions (2) and (3), the result is 


kA 


dw 

dx 


-kA 


dw 

dx 


= cpA Ax 


Aw 
At' 


or 


k 

dw dx\ -dw/dx\ 

' \x+Ax ' \x 

cp 

Ax 


Aw 
At ' 


Finally, by letting Ax and At —>• 0 we obtain the desired equation, 

, d 2 w dw 
a~ —r- = —, 

5x 2 dt 


(4) 


where a 2 -k/cp. This is the physical reasoning that leads to the one¬ 
dimensional heat equation. The three-dimensional equation (1) can be 
derived in essentially the same way. 

We now solve the one-dimensional heat equation (4), subject to the fol¬ 
lowing set of conditions: the rod is n units long and lies along the x-axis 
between x = 0 and x = jr; the initial temperature is a prescribed function/(x), 
so that 


ic(x,0)=/(x); (5) 

and the ends of the rod have the constant temperature zero for all values 
of t > 0, 


w(0,t) = 0 and ic(jt,f) = 0. (6) 

We try for a solution of this boundary value problem by the method of sepa¬ 
ration of variables that worked so well in the case of the wave equation; that 
is, we seek a solution of (4) having the form 

w(x,t) = u(x)v(t). (7) 

When this expression is substituted in (4), the result can be written 

u"(x ) _ 1 v'(t) 
u(x) a 2 v(t) 


( 8 ) 
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Since each side of this equation depends on only one of the variables, both 
sides must be constant, and if we denote this common constant value by -X, 
then (8) splits into the two ordinary differential equations 

u"+Xu = 0 (9) 

and 

v' + Xa 2 v = 0. (10) 

Just as in Section 40, we solve (9) and satisfy the boundary conditions (6) by 
setting X = n 2 for any positive integer n, and the corresponding eigenfunc¬ 
tion is 


u„(x) = sin nx. 

With this value of X, equation (10) becomes 


v' + n 2 a 2 v = 0, 


which has the easy solution 


v„{t) = e 1 


The resulting products of the form (7) are therefore 

w n (x,t) = e~ n " ‘sinnx, n= 1,2,3,.... (11) 

This brings us to the point where we know that each of the functions (11) 
satisfies equation (4) and the boundary conditions (6), and it is clear that the 
same is true for any finite linear combination of the zv n : 

b ^“ f sin x + b 2 e^ ia f sin 2x + —f- b n e~ n “ 1 sin nx. (12) 

Without dwelling on the important mathematical issues of convergence and 
term-by-term differentiability, we now pass from (12) to the corresponding 
infinite series, 

oo 

w(x,t) = y [ b n e~ n2c,2t 

n =1 


sm nx. 


(13) 


370 


Differential Equations with Applications and Historical Notes 


This will be a solution of our original boundary value problem if it allows us 
to satisfy the initial condition (5), that is, if (13) reduces to the initial tempera¬ 
ture distribution/^) when t = 0: 


oo 

/« = sin nx. 

n=\ 


(14) 


To finish this part of our work and make the solution (13) completely explicit, 
all that remains is to determine the b„ as the Fourier coefficients in the expan¬ 
sion (14) of f(x) in a Fourier sine series. 



sin nxdx. 


(15) 


Example 1. Suppose that the thin rod discussed above is first immersed 
in boiling water so that its temperature is 100°C throughout, and then 
removed from the water at time t = 0 with its ends immediately put in 
ice so that these ends are kept at temperature 0°C. Find the temperature 
iv = w{x,t) under these circumstances. 

Solution. This is the special case of the above discussion in which the 
initial temperature distribution is given by the constant function 

/(x) = 100, 0<x<jt. 

We must therefore find the sine series of this function, which we can 
either calculate from scratch by using (15) or obtain in some other way 
(see Problem 35-4), 


m 


400 f . sin3x sin5x 
it (, 3 5 


By referring to formula (13), we now see that the desired temperature 
function is 


w(x,t ) 


400 

n 


e " 2f sinx + 



sin3x + 


ig-25.3 

5 


sin5x + --- 


Example 2. Find the steady-state temperature of the thin rod discussed 
above if the fixed temperatures at the ends x=0 and x = jt are w 1 and w 2 , 
respectively. 
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Solution. "Steady-state" means that dzv/dt=0, so the heat equation (4) 
reduces to d * 1 2 3 w/dx 2 =0 or d 2 w/dx 2 = 0. The general solution is therefore 
w = CjX + c 2/ and by using the boundary conditions we easily determine 
these constants of integration and obtain the desired solution, 

W = W\ + — (w 2 - Wi)X. 
n 


The steady-state version of the three-dimensional heat equation (1) is 

d 2 w d 2 w d 2 w . 

— v + — v + — v = 0 ; 

dx 2 dy 2 dz 2 


(16) 


it is called Laplace's equation. The study of this equation and its solutions 
and uses—there are many applications in the theory of gravitation—is a 
rich branch of mathematics called potential theory. This topic is continued in 
Appendix A at the end of the next chapter. The corresponding equation in 
two dimensions is 


d 2 w d 2 zv 
dx 2 dy 2 


= 0 ; 


(17) 


this is a valuable tool if plane problems are under consideration. Equation 
(17) also has a special significance of its own in complex analysis. 


Problems 

1. Derive the three-dimensional heat equation (1) by adapting the reason¬ 
ing in the text to the case of a small box with edges Ax, Ay, Az contained 
in a region R in xyz-space where the temperature function w(x,y,z,t) is 
sought. Hint: Consider the flow of heat through two opposite faces of 
the box, first perpendicular to the x-axis, then the y-axis, and finally the 
z-axis. 

2. Solve the boundary value problem in the text if the conditions are 
altered from (5) and (6) to 

iv (x,0) =f(x) and w(0,t) = w v zv(n,t) = w 2 . 

Hint: Write w{x,t) = W(x,t) +g(x) and remember Example 2. 

3. Suppose that the lateral surface of the thin rod in the text is not insu¬ 
lated, but instead radiates heat into the surroundings. If Newton's 
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law of cooling applies, show that the one-dimensional heat equation 
becomes 


a 


2 


d 2 zv 

dx 2 


8w , . 

— + c(w-w 0 ), 
at 


where c is a positive constant and w 0 is the temperature of the 
surroundings. 

4. In the preceding problem, find w(x,t) if the ends of the rod are kept at 
0°C, zv 0 -0°C, and the initial temperature distribution is f(x). 

5. In Example 1, suppose the ends of the rod are insulated instead of being 
kept at 0°C. What are the new boundary conditions? Find the tempera¬ 
ture zv(x,t) in this case by using only common sense. 

6. Solve the problem of finding w(x,t) for the rod with insulated ends at 
x = 0 and x = n (see the preceding problem) if the initial temperature dis¬ 
tribution is given by w(x, 0) =f(x). 

7. The two-dimensional heat equation is 

2 f d 2 w dw 

v dx 2 dy J dt 


Use the method of separation of variables to find a steady-state solu¬ 
tion of this equation in the infinite strip of the xy-plane bounded 
by the lines x = 0, x = n and y- 0 if the following conditions are 
satisfied: 


zf(0,y) = 0, w(n,y) = Q, 

w(x,0) = f(x), limic(x,i/) = 0. 

y->® 


42 The Dirichlet Problem for a Circle. Poisson's Integral 

We continue our overall program in this chapter of acquainting the stu¬ 
dent with important mathematical problems related to both partial dif¬ 
ferential equations and Fourier series. Even though we cannot treat these 
problems in the depth they deserve within the limitations of the present 
book, at least it is possible to convey an impression of what these problems 
are and briefly describe some of the standard methods for dealing with 
them. 
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We begin with the two-dimensional Laplace equation mentioned at the 
end of Section 41. In rectangular coordinates ( x,y) it is 


d 2 w d 2 zv 
—t = 0 ; 
dx" dy 2 


( 1 ) 


and in polar coordinates (r, 0) it is 


d 2 w 1 dw 1 d 2 zv 
dr 2 r dr r 2 30 2 


( 2 ) 


It is an exercise in the use of the chain rule for partial derivatives to trans¬ 
form these equations into one another (see Problem 1 below). Many types of 
physical problems require solutions of Laplace's equation, and there exists 
a wide variety of solutions containing many different kinds of functions. 
However, just as in the preceding sections, a specific physical problem usu¬ 
ally asks for a solution that is defined in a certain region and satisfies a given 
condition on the boundary of that region. 

There is a famous problem in analysis called the Dirichlet problem, one ver¬ 
sion of which can be stated as follows: Given a region R in the plane bounded 
by a simple closed curve C, and given a function/(P) defined and continuous 
for points P on C, it is required to find a function zv(P) continuous in R and 
on C, such that zv(P) satisfies Laplace's equation in R and equals f(P) on the 
boundary C. 

We shall consider the special case in which R is the interior of the unit 
circle x 2 + y 2 - 1, and we use polar coordinates as the geometry suggests. Let 
w = w(r, 0) be a function continuous inside and on this circle. The values of 
this function when r- 1 are called its boundary values for the circular region. 
The function a;(l,0) is evidently a continuous function of 0 with period 2jt. 
The Dirichlet problem for this circular region is then the following: Let/(0) 
be any given continuous function of 0 with period 2n. It is required to find a 
function iv = w(r, 0) that satisfies Laplace's equation (2) for 0 < r < 1, and has the 
further property that a;(l,0) =/(0) for each value of 0. In some versions of the 
Dirichlet problem the condition that/(0) must be continuous is relaxed and 
the condition iv( 1,0) =/(0) is expressed in a different form; we shall comment 
further on these matters below. 

If w is understood to be temperature, then we know from our work in 
Section 41 that the Dirichlet problem for a circle is the problem of finding the 
steady-state temperature throughout a thin circular plate when the tempera¬ 
ture along the edge is prescribed in advance. Solutions of Laplace's equa¬ 
tion are often called harmonic functions. Using this language, the Dirichlet 
problem is the problem of finding a function that is harmonic in the circular 
region and assumes preassigned continuous values on the boundary. 
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Now for the details of solving this problem. We begin by ignoring the 
boundary function/(0) and seeking solutions of Laplace's equation (2) that 
have the form w = w(r,Q) = u(r)v(Q), that is, that can be written as the product 
of a function of r alone and a function of 0 alone. Thus, we make yet another 
application of the method of separation of variables. When this function is 
substituted in equation (2) we obtain 

u"(r)v(Q) + - u'(r)v(Q) + \ u(r)v"(Q) = 0 
r r 


or 


r 2 u"{r)+ru\r) _ i/'(0) 
u(r) v(8) 


(3) 


The left side of (3) is independent of 0, and the right side is independent of r, 
so both sides must be constant; and if we denote this common constant value 
by X, then (3) splits into the two equations 


v" + Xv = 0 (4) 

and 

r 2 u" + ru'-Xu = 0. (5) 

We want z;(0) to be continuous and periodic with period 2it—and, of course, 
not identically zero. This requires us to conclude that the constant X in (4) 
must be of the form X-n 2 with n = 0, 1, 2, 3, ... For n = 0 the only suitable 
solution is v = a constant, and for n = 1, 2, 3, ... the solutions of (4) are linear 
combinations of cos n0 and sin n0, 

v„(8) = a n cos nQ + b n sin nQ. 

We next set X = n 2 in equation (5), which then becomes 


r —y + r - n u = 0. 

dr dr 

This is Euler's equidimensional equation (Problem 17-5), with solutions 

u(r) = A + B log r if n = 0, 

u(r) = Ar n +Br n if n = 1,2,3,..., 
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where A and B are constants. We want u(r) to be continuous at r- 0, so we 
take B = 0 in all cases, and we therefore have 


u»(r) = r n . 

If we now write down all the solutions w = u n (r)v n (Q) in sequential order, the 
result is as follows: 


n = 0, 

1 

vo = a constant — tz 0 ; 

2 

77 = 1, 

w = r{a x cos 0 + bi sin 0); 

77 = 2, 

w = r 2 (a 2 cos 20+ £> 2 sin 20); 

77 = 3, 

w = r 3 (a 3 cos 30 + b 3 sin 30); 


It is easy to see that any finite sum of solutions of Laplace's equation is also 
a solution, and the same is true for an infinite series of solutions if the series 
has suitable convergence properties. This leads us to the solution 


w = w(r, 0) = 


1 

-«o + 


00 

cos n0 + b„ sin770). 

n =1 


( 6 ) 


If we put r- 1 in (6) and remember that we want to satisfy the boundary con¬ 
dition 777(1,0) =/ (0), then we obtain 


/( 0 ) - ^ a o + 


00 

„ COS 77 0 + b n sin 77 0). 

n =1 


( 7 ) 


It is now clear what must be done to solve the Dirichlet problem for the 
unit circle: start with the given boundary function/(0) and find its Fourier 
series (7); then form the solution (6) by merely inserting the factor r” in front 
of the expression in parentheses in (7). Of course, the constant term in (6) 

is written as — a 0 for the sake of agreement with the standard notation for 
Fourier series. 


Example. Solve the Dirichlet problem for the unit circle if/ (0) = 1 on the 
top half of the circle (O<0<jt) and/(0) = -l on the bottom half of the circle 
(-ji < 0 < 0), with/ (0) =/ (±jt) = 0. 

Solution. We know from Problem 35-4 that the Fourier series for/(0) is 


m 


4( . sin30 sin50 ) 
—I sin0 +-+-+ ■•• I. 


71 


3 


5 
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The solution of the Dirichlet problem is therefore 


zv(r,Q) 


1 1 
rsin0 + —r 3 * sin30 + —r 5 sin50 + --- I. 


The discussion given above is concerned mostly with formal procedures and 
not with delicate questions of convergence. However, we state without proof 
that if the a n and b n are the Fourier coefficient's of/(0), then the series (6) con¬ 
verges for 0 < r < 1 and its sum w(r,Q) is a solution of Laplace's equation in this 
region. For this to be true it is not necessary to assume that/(0) is continuous, 
or even that its Fourier series converges. It is enough to assume that / (0) is 
integrable. Furthermore, even with this weak hypothesis it turns out that/(0) 
is the boundary value of ic(r,0), in the sense that 

limze(r,0) = /(0) 


at every point of continuity of the function/(0). These remarkable facts have 
emerged from careful theoretical studies of the Poisson integral, which we 
now briefly describe. 3 


The Poisson integral. The Dirichlet problem for the unit circle is now solved, 
at least formally. However, a simpler expression for this solution can be found 
as follows, if we don't mind a bit of calculating with complex numbers. As we 
know, the coefficients in (6) are given by the formulas 


Un ~ 


n k 

— f(§) cos m j> d( j), b n = — /(())) sin n§ d§. 

k J n J 


When these are substituted in (6), then by using the identity 
cos (0 — tf>) = cos 0 cos 4> + sin 0 sin 4> 

and interchanging the order of integration and summation, we obtain 


if 1 J”, 

zv(r,Q) = — [/(<)>) — + y'V" cos«(0-())) 
n J 2 


( 8 ) 


3 More details on these interesting matters of theory can be found in H. S. Carslaw, Introduction 

to the Theory of Fourier's Series and Integrals, 3d ed., Macmillan, London, 1930, pp. 250-254; R. T. 

Seeley, An Introduction to Fourier Series and Integrals, W A Benjamin, New York, 1966, pp. 16-19; 
or pp. 436-442 of the book of Sz.-Nagy mentioned in Section 39. 
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To sum the series in brackets, we put a = 0 - 4> and let z = re™ = r(cos a + i sin a). 
Then z n = r n e ina = r n (cos na + i sin na) and 


1 °° 1 °° 

— + ^"V" cos na = real part — + ^z" 


= real part 

1 1 
— + 

2 1- 

= real part 

1 + z 

L 2(1— z)_ 

= real part 

(l + z)(l 

21-2 


1- | z | 2 _ 1-r 2 

2 11 — z | 2 2(l-2rcosa + r 2 ) 


By substituting this in (8) we obtain 


w (r, 9) = — 

2ji 


71 

>71 J 1- 


2rcos(0-())) + r' 


-/(<t>)rf<l>- 


( 9 ) 


This remarkable formula for the solution of the Dirichlet problem is called 
the Poisson integral; it expresses the value of the harmonic function zv(r,Q) at all 
points inside the circle in terms of its values on the circumference of the circle. 
It should also be observed that for r = 0 formula (9) yields 




This shows that the value of the harmonic function w at the center of the 
circle is the average of its values on the circumference. 

NOTE ON POISSON. Simeon Denis Poisson (1781-1840), a very eminent 
French mathematician and physicist, succeeded Fourier in 1806 as full pro¬ 
fessor at the Ecole Polytechnique. In physics, Poisson's equation describes 
the variation of potential inside continuous distributions of mass or electric 
charge, just as Laplace's equation does in empty space. He also made impor¬ 
tant theoretical contributions to the study of elasticity, magnetism, heat, and 
capillary action. In pure mathematics, the Poisson summation formula is a 
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major tool in analytic number theory, and the Poisson integral pointed the 
way to many important developments in Fourier analysis. In addition, he 
worked extensively in probability. It was he who named the law of large 
numbers; and the Poisson distribution—or law of small numbers—has many 
applications to such phenomena as the distribution of blood cells on a micro¬ 
scope slide, of automobiles on a highway, of customers at a theater ticket 
office, etc. According to Abel, Poisson was a short, plump man. His family 
tried to encourage him in many directions, from being a doctor to being a 
lawyer, this last on the theory that perhaps he was fit for nothing better, but 
at last he found his niche as a scientist and produced over 300 works in a 
relatively short lifetime. "La vie, c'est le travail (Life is work)," he said—and 
he had good reason to know. 


Problems 

1. If if = F(x,y) = G(r,0) with x-r cos 0 and y = r sin 0, show that 


Hint: 


d 2 w d 2 w _ 1 
dx 2 '" ,2 


dy 


d ( dw \ 1 d 2 w 
3rl dr ) r 30 2 


d 2 w 1 dw 1 d 2 w 
dr 2 r dr r 2 30 2 


dw dw „ dw . „ . 

— = —cos0 + —sm0 and 
dr dx dy 


dzv dzv, . div , 

— = —(-rsm0) + — (rcos0). 
30 dx v dy 


d ( __dw\ , d 2 w 


Similarly, compute — •—J and 

2. Solve the Dirichlet problem for the unit circle if the boundary function 
/(0) is defined by 
1 

(a) /(0) = cos — 0, -n < 0 < ji; 

(b) /(0) = O, -jt<0<jr; 

(c) f(Q) = 0 for —jr < 0<O, f(ff) = sin 0 for 0 < 0 < 7t; 

(d) /(0) = 0 for -jt < 0 < 0, /(0) = 1 for 0 < 0 < tt; 

(e) f = 0 2 , -7t < 0 < jr. 
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3. Show that the Dirichlet problem for the circle x 2 + y 2 -R 2 , where/(0) is 
the boundary function, has the solution 



where a n and b n are the Fourier coefficients of/(0). Show also that the 
Poisson integral for this more general case is 


K 


J_ r R 2 -r 2 

2nJ R 2 - 2R2 cos(0 -()>) + r 


w(r,Q) = 2tt I 


r:- if 


-n 


4. Let w(P) be harmonic in a plane region, and let C be any circle entirely 
contained in this region. Prove that the value of w at the center of C is 
the average of its values on the circumference. (This is a major theorem 
of potential theory due to Gauss.) 


43 Sturm-Liouville Problems 

We return briefly to the discussion of eigenvalues and eigenfunctions at 
the beginning of Section 40. Our purpose here is to place these ideas in a 
broader context that will help make an easier transition to the topics of the 
next chapter. 

As we know, a sequence of functions y„(x) with the property that 



( 1 ) 


is said to be orthogonal on the interval [a,b\. If cc„ =1 for all n, the functions are 
said to be normalized, and we speak of an orthonormal sequence. A more gen¬ 
eral type of orthogonality is defined by the property 



a 


In this case the sequence is said to be orthogonal with respect to the weight func¬ 
tion q(x). Orthogonality properties of this kind are possessed by the eigen¬ 
functions associated with a wide variety of boundary value problems. 
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Consider a differential equation of the form 


d 

dx 



+ [kq(x)+r(x)]y = 0, 


(3) 


for which we are interested in solutions valid on the interval [a,b]. We know 
from Theorem A in Section 14 that if p(x), p'(x), q(x), and r(x) are continu¬ 
ous on this interval, and if p(x) does not vanish there, then there is one and 
only one solution y(x) for the initial value problem in which we arbitrarily 
assign prescribed values to both y(a) and y'(a). Suppose, however, that we 
wish to assign prescribed values to both y(a) and y(b), that is, to y(x) at two 
different points, rather than to y(x) and y'(x) at the same point. We examine 
the circumstances under which this boundary value problem has a nontrivial 
solution. 


Example 1. At the beginning of Section 40 we considered the special case 
of (3) in which p(x) = q{x) = 1 and r(x) = 0, so that the equation is 

y" + Xy=0. 

The interval was taken to be [0,Jt] and the boundary conditions were 
y(0) = 0 and y(it) = 0. 

We found that for this problem to be solvable X must have one of the 
values 

X n = n 2 , n = 1,2,3, ..., 

and that corresponding solutions are 

y„(x) = sin nx. 


We called the X H the eigenvalues of the problem, and the y n (x) are correspond¬ 
ing eigenfunctions. 

In the case of the more general equation (3), it turns out that if the func¬ 
tions p(x) and q{x) are restricted in a reasonable way—specifically, if p(x) > 0 
and q(x) > 0 on [a,b\ —then we will also be able to obtain nontrivial solutions 
satisfying suitable boundary conditions at the two distinct points a and b if 
and only if the parameter X takes on certain specific values. These are the 
eigenvalues of the boundary value problem; they are real numbers that can be 
arranged in an increasing sequence 


X : <X 2 <X 3 <---<X n <X n+ T<---, 


( 4 ) 
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and furthermore. 


X n —> °° as n -* 

This ordering is desirable because it enables us to arrange the corresponding 
eigenfunctions 


yfx), y 2 (x ),..., yn(x ),... (5) 

in their own natural order. As in the case of Example 1, the eigenfunctions 
are not unique, but with the boundary conditions we will be interested in, 
they are determined up to a nonzero constant factor. 

We now look for possible orthogonality properties of the sequence of 
eigenfunctions (5), and in the process of doing this, we will discover what 
types of boundary conditions are "suitable." Consider the differential equation 
(3) written down for two different eigenvalues X m and with y m and y n the 
corresponding eigenfunctions: 


„ dy m 

dx dx 


+[Kq+r]y m = 0 


and 


d 

dx 


V 


dy„ 

dx 


+ [X n q + r]y n =0. 


If we shift to the more compact prime notation for derivatives, then on mul¬ 
tiplying the first equation by y n and the second by y m , and subtracting, we 
find that 


Vnipy'm)' - y m {py'n)'+(K,~ K)qy m y n = o. 

We now move the first two terms to the right and integrate from a to b, using 
integration by parts, to obtain 


b 



= p(b)[ym(b)y'n(b) - y,,(b)iy' m (b)] - p(a)[y m (a)y' n (a) - y„(a)y' m (a)\. (6) 
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If we denote by W(x) the Wronskian determinant of the solutions y,„(x) and 
y„(x), which is defined by 


W(x) = 


y m (x) 

Vn(x) 


y'm(x) 

y'n(x) 


y m (x)y'n(x) - y n (x)y' m (x), 


then (6) can be written in the convenient form 

b 

(K,-K)jqy,„y„ dx = p(b)W(b)-p(a)W(a). (7) 

a 

We point out particularly that the integrations by parts in the calculation (6), 
and the consequent cancellations, are possible only because of the special 
form of the first term in the differential equation (3). 4 

We want the right side of (6) or (7) to vanish, so that we can obtain the 
orthogonality property 


b 

J* qy m y n dx = 0 for m * n. (8) 

a 

By looking at the right side of (6), we see that this will certainly happen if the 
boundary conditions required of a nontrivial solution of (3) are 

y{a) = 0 and y(b) = 0 


or 


y'(d) = 0 and y'(b) = 0. 

Each of these is a special case of the more general boundary conditions 

c 1 y(a) + c 2 y’{a) = 0 and d : ij(b) + d 7 y'(b) = 0 (9) 

where c v or c 2 # 0 and d } or d 2 / 0. To see that these boundary conditions 
really do make the right side of (7) vanish, suppose that the solutions y m (x) 
and y n (x) both satisfy the first condition (9), so that 

C\y m{a) + c 2 y’ m {a) = 0, 

c 1 y„(fl) + c 2 y'„(fl) = 0. 


4 Differential equations having this special form are called self-adjoint. See the problems below 
for an explanation of this terminology. 
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Since this system has a nontrivial solution c 1 c 2 , the coefficient determinant 
must vanish: 


y m (a) 

y*(«) 


y'n{a) 

y'n{a) 


= W(a) = 0. 


Similarly W(b) = 0, and it follows from this that the right side of (7) vanishes. 

Boundary conditions of the form (9) are called homogeneous boundary condi¬ 
tions. Their special feature is the fact that any sum of solutions of equation (3) 
that individually satisfy such boundary conditions will also satisfy the same 
boundary conditions. Any differential equation of the form (3) with homoge¬ 
neous boundary conditions is called a Sturm-Liouville problem. 

The significance of these ideas is that the orthogonality property (8) gives 
us a formal method for finding series expansions of functions/(x) in terms 
of the eigenfunctions of such a Sturm-Liouville problem. Formally, we are 
led to the following procedure. We assume that /(x) can be written in the 
form 


/ (*) = ayjfx) +« 2 y 2 (x)+ ■ ■ • a n y n (x) + ■■■ (10) 

Multiplying both sides of this by q(x)y n {x) and integrating term by term from 
a to b yields 


b 

| f (x)q(x)y„(x)dx 

a 


D D 

■■ fli j* q(x)y 1 (x)y n (x)dx + ■ ■ ■ + a n j q(x)[y„(x)] 2 dx + • 


=a 


n 



(ll) 


because of (8). With the coefficients a„ determined by (11), formula (10) is 
called an eigenfunction expansion off(x). 

A very important mathematical question now arises that is familiar to us 
from Chapter 6 and the earlier sections of this chapter—how do we know 
that the series (10) with coefficients determined by (11) really represents 
f(x)? And what does "represents" mean? Does it mean in the sense of point- 
wise convergence? Or mean convergence? Or perhaps some other concept 
altogether? We have seen in Chapter 6 how difficult some of these theoreti¬ 
cal problems are for ordinary Fourier series, which are the simplest of all 
eigenfunction expansions. Two further special cases that are particularly 
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important for applications to physics are concerned with the orthogonal 
sequences of the Legendre polynomials and the Bessel functions. These two 
sequences of functions, and their properties, and the associated eigenfunc¬ 
tion expansions, are the subject of the next chapter. 

Self-adjoint boundary value problems of the kind described above are 
called regular, because the interval [a,b] is finite and the functions p(x) and 
q(x) are positive and continuous on the entire interval. Singular problems are 
those in which one of these functions vanishes or becomes infinite at an 
endpoint, or the interval itself is infinite. Unfortunately, many of the more 
important problems are singular, and the theory must be correspondingly 
more complicated to cope with them . 5 


Example 2. Consider the important Legendre equation in its self-adjoint 
form, 


d_ 

dx 


(l-* 2 ) 


2\ d V 


dx 


+ Xy = 0, 


-1 < x < 1 . 


Here the function p(x)- 1 - x 2 vanishes at both endpoints. No boundary 
conditions of the usual kind are imposed at the endpoints x = ±l, but it is 
required that the solutions remain bounded near these points. It turns out 
that this happens only when X-n(n +1) for n = 0,1, 2,..., and the correspond¬ 
ing solutions are the Legendre polynomials P n (x). The details of this singular 
self-adjoint boundary value problem are found in Chapter 8 . 

Remark. We have done little more in this section than acquaint the student 
with some of the issues in this subject, and we have certainly not provided 
any substantive proofs. One of the first questions about any self-adjoint 
boundary value problem—Sturm-Liouville or otherwise—is this: Does there 
exists an adequate supply of eigenvalues and corresponding eigenfunctions? 
For the reader who is interested in these theoretical matters, a full and rig¬ 
orous proof of this existence theorem is given in Appendix A, but only for 
a somewhat special case of the regular Sturm-Liouville problem described 
above. 


Note on Liouville. Joseph Liouville ( 1809 - 1882 ) was a highly respected pro¬ 
fessor at the College de France in Paris and the founder and editor of the 
Journal des Mathematiques Pares et Appliquees, a famous periodical that played 
an important role in French mathematical life throughout the nineteenth 
century. For some reason, however, his own remarkable achievements as a 


5 Full treatments can be found in E. C. Titchmarsh, Eigenfunction Expansions, 2 vols, Oxford 
University Press, 1946 and 1958; and in E. A. Coddington and N. Levinson, Theory of Ordinary 
Differential Equations, McGraw-Hill, New York, 1955. 
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creative mathematician have not received the appreciation they deserve. The 
fact that his collected works have never been published is an unfortunate 
and rather surprising oversight on the part of his countrymen. 

He was the first to solve a boundary value problem by solving an equiva¬ 
lent integral equation, a method developed by Fredholm and Hilbert in 
the early 1900s into one of the major fields of modern analysis. His inge¬ 
nious theory of fractional differentiation answered the long-standing ques¬ 
tion of what reasonable meaning can be assigned to the symbol d n y/dx n 
when n is not a positive integer. He discovered the fundamental result in 
complex analysis now known as Liouville's theorem —that a bounded entire 
function is necessarily a constant—and used it as the basis for his own 
theory of elliptic functions. There is also a well-known Liouville theorem 
in Hamiltonian mechanics, which states that volume integrals are time- 
invariant in phase space. His theory of the integrals of elementary func¬ 
tions was perhaps the most original of all his achievements, for in it he 
proved that such integrals as 



as well as the elliptic integrals of the first and second kinds, cannot be 
expresed in terms of a finite number of elementary functions. 6 

The fascinating and difficult theory of transcendental numbers is another 
important branch of mathematics that originated in Liouville's work. The 
irrationality of n and e —that is, the fact that these numbers are not roots 
of any linear equation ax + b = 0 whose coefficients are integers—had been 
proved in the eighteenth century by Lambert and Euler. In 1844 Liouville 
showed that e is also not a root of any quadratic equation with integral coef¬ 
ficients. This led him to conjecture that e is transcendental, which means that 
it does not satisfy any polynomial equation 


a n x n + a n _ 1 x n ~ l + ■ ■ • + rtjX + a 0 - 0 


with integral coefficients. His efforts to prove this failed, but his ideas con¬ 
tributed to Hermite's success in 1873 and then to Lindemann's 1882 proof 
that it is also transcendental. Lindemann's result showed at last that the age- 
old problem of squaring the circle by a ruler-and-compass construction is 
impossible. One of the great mathematical achievements of modern times 
was Gelfond's 1929 proof that e % is transcendental, but nothing is yet known 


6 See D. G. Mead, "Integration," Am. Math. Monthly, vol. 68, pp. 152-156 (1961). For additional 
details, see G. H. Hardy, The Integration of Functions of a Single Variable, Cambridge University 
Press, London, 1916; or J. F. Ritt, Integration in Finite Terms, Columbia University Press, 
New York, 1948. 
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about the nature of any of the numbers n + e, ne or n e . Liouville also discov¬ 
ered a sufficient condition for transcendence and used it in 1844 to produce 
the first examples of real numbers that are provably transcendental. One of 
these is 


1 _ 1 + 1 + 1 

io r+ icF + io fr+ 

n =1 


0.11000100 •••. 


His methods here have also led to extensive further research in the twentieth 
century. 7 


Problems 

1. The differential equation P(x)y" + Q(x)y' +R(x)y=0 is called exact if it 
can be written in the form [P(x)y']' + [S(x)y]' = 0 for some function S(x). 
In this case the second equation can be integrated at once to give the 
first order linear equation P(x)y' + S(x)y-c v which can then be solved 
by the method of Section 10. By equating coefficients and eliminating 
S(x), show that a necessary and sufficient condition for exactness is 
P"(x ) - Q'(x) + R(x) = 0. 

2. Consider the Euler equidimensional equation that arose in Section 42, 

x 2 y" +xy' - n 2 y = 0, 

where n is a positive integer. Find the values of n for which this equation 
is exact, and for these values find the general solution by the method 
suggested in Problem 1. 

3. If the equation in Problem 1 is not exact, it can be made exact by mul¬ 
tiplying by a suitable integrating factor p(x). Thus, p(x) must satisfy 
the condition that the equation p{x)P{x)y" + p(x)Q(x)y' + u(x)R(x)y = 0 is 
expressible in the form [ \i(x)P(x)yf + [S(x)y]' = 0 for some function S(x). 
Show that p(x) must be a solution of the adjoint equation 

P(x)p" + [2P'(x) - Q(x)]p' + [P"(x) - Q'(x) + P(x)]p = 0. 

In general (but not always) the adjoint equation is just as difficult to 
solve as the original equation. Find the adjoint equation in each of the 
following cases: 


7 An impression of the depth and complexity of this subject can be gained by looking into A. 
O. Gelfond, Transcendental and Algebraic Numbers, Dover, New York, 1960. 
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(a) Legendre's equation: (1 - x 2 )y" - 2 xy' + p(p + l)y = 0; 

(b) Bessel's equation: x 2 y" + xy' + (x 2 - p 2 )y = 0; 

(c) Chebyshev's equation: (1 - x 2 )y" - xy' +p 2 y = 0; 

(d) Hermite's equation: y" - 2 xy' + 2 py = 0; 

(e) Airy's equation: y" + xy = 0; 

(f) Laguerre's equation: xy" + (1 - x)y' +py = 0. 

4. Solve the equation 


y"-[ 2 *+!)y'- 4 y = ° 

by finding a simple solution of the adjoint equation by inspection. 

5. Show that the adjoint of the adjoint of the equation P(x)y" + Q(x)y' + 
R(x)y = 0 is the original equation. 

6 . The equation P(x)y" + Q(x)y' + R(x)y = 0 is called self-adjoint if its adjoint is 
the same equation (except for notation). 

(a) Show that this equation is self-adjoint if and only if P'(x) = Q(x). In 
this case the equation becomes 

P(x)y" + P'(x)y' + R(x) = 0 


or 


[P(x)y']'+R(x)y= 0, 

which is the standard form of a self-adjoint equation. 

(b) Which of the equations in Problem 3 are self-adjoint? 

7. Show that any equation P(x)y" + Q{x)y' + R(x)y = 0 can be made self- 
adjoint by multiplying through by 

\(Q/P)dx 

P £ 

8 . Using Problem 7 when necessary, put each equation in Problem 3 into 
the standard self-adjoint form described in Problem 6 . 

9. Consider the regular Sturm-Liouville problem consisting of 
equation (3) with the boundary conditions (9). Prove that every eigen¬ 
function is unique except for a constant factor. Hint: Let y = u(x) and 
y = v(x) be eigenfunctions corresponding to a single eigenvalue X, 
and use their Wronskian to show that they are linearly dependent 
on [a,b]. 
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10. Consider the following self-adjoint boundary value problem on [a,h\. 


d_ 

dx 


p{x) 


dy_ 

dx 


y(«) = y(b) 


+ [Xq(x) + r(x)]y = 0, 
and y'(a) = y\b), 


where p(a)-p(b). It is assumed that p(x), p\x), q(x), and r(x) are continu¬ 
ous and that p(x) > 0 and q(x) > 0 for a < x <b. This problem is then said 
to have periodic boundary conditions. It can be proved that there exists a 
sequence of eigenvalues 

A, 0 < < A, 2 < • • • < < X n+1 < ■ ■ ■ 


such that 


lim^„ = co. 

n—>oo 


(a) By examining the calculation (6), show that eigenfunctions corre¬ 
sponding to distinct eigenvalues are orthogonal with respect to the 
weight function q(x). 

(b) In this case, however, to each eigenvalue there may correspond 
either one or two linearly independent eigenfunctions. Verify this 
by finding the eigenvalues and corresponding eigenfunctions for 
the problem y" + Xy = 0, where y(-jt) = y(n) and y'(-jt) = y'{n). 

(c) Why can this problem not have more than two independent eigen¬ 
functions associated with a particular eigenvalue? 


Appendix A. The Existence of Eigenvalues and Eigenfunctions 

The general theory of eigenvalues, eigenfunctions, and eigenfunction expan¬ 
sions is one of the deepest and richest parts of modern mathematics. In this 
appendix we confine our attention to a small but significant fragment of this 
broad subject. Our primary purpose is to prove that any boundary value 
problem of the form 40—(23)—which arose in connection with the nonhomo- 
geneous vibrating string—has eigenvalues and eigenfunctions with proper¬ 
ties similar to those encountered in Section 40. Once this is accomplished, we 
will find that a simple change of variable allows us to extend this result to a 
considerably more general class of problems. 

We begin with several easy consequences of the Sturm comparison 
theorem. 
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Lemma 1. Let y(x) and z(x) be nontrivial solutions of 

y" + q(x)y = 0 


and 


z" + r(x)z - 0, 

where q(x) and r(x) are positive continuous functions such that q(x)>r(x). Suppose 
that y(x) and z(x) both vanish at a point b 0 , and that z(x) has a finite or infinite num¬ 
ber of successive zeros b y b 2 , ■ ■., b n , ...to the right of b 0 . Then y(x) has at least as 
many zeros as z(x) on every closed interval [b 0 ,b n ]; and if the successive zeros ofy(x) 
to the right of b 0 are a y a 2 , ..., a n , ..., then a n <b n for everyn. 

Proof. By the Sturm comparison theorem (Theorem 25-B), y(x) has at least 
one zero in each of the open intervals (b 0 ,b x ), (b u b 2 ),..., (b n _ u b n ), and both state¬ 
ments follow at once from this. 


Lemma 2. Let q(x) be a positive continuous function that satisfies the inequalities 

0 <m 2 <q(x)<M 2 

on a closed interval [a,b]. Ify(x) is a nontrivial solution ofy" + q(x)y = 0 on this inter¬ 
val, and ifx x and x 2 are successive zeros ofy(x), then 


71 71 

- <X 2 ~Xi <—. 

M m 


( 1 ) 


Furthermore, ify{x) vanishes at a and b, and at 77-1 points in the open interval ( a,b), 
then 


m(b-a) <n< M(b-a) ^ 

n n 

Pro of. To prove (1), we begin by comparing the given equation with z" + m 2 z - 0. 
A nontrivial solution of this that vanishes at x 2 is z(x) = sin m(x - ay). Since the 
next zero of z(x) is x, + k/iu, and Theorem 25-B tells us that x 2 must occur 
before this, we have x 2 <x l + n/m or x 2 - x l <n/m. A similar argument gives 
the other inequality in (1). 

To prove (2), we first observe that there are n subintervals between the 
n +1 zeros, so by (1) we have b - a = the sum of the lengths of the n subin- 
tervals < n(7i/m), and therefore m(b - d)/n<n. In the same way we see that 
b - a>n(n/M), so 77<M(b - a)/n. 

Our main preliminary result is the next lemma. 
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Lemma 3. Let q(x) be a positive continuous function and consider the differential 
equation 


y" + Xq(x)y =0 (3) 

on a closed interval [a,b\. For each X, let yfx) be the unique solution of equation (3) 
which satisfies the initial conditions yfd)-0 and y' k (a) = l. Then there exists an 
increasing sequence of positive numbers 

X k <X 2 < <X„< 

that approaches °° and has the property that y k (b)- 0 if and only ifX equals one 
of the X n . Furthermore, the function y kn (x) has exactly n - 1 zeros in the open 
interval (a,b). 

Proof. It is clear by Theorem 24-B that yfx) has no zeros to the right of a when 
X < 0. Our plan is to watch the oscillation behavior of yfx) as X increases 
from 0. We begin with the observation that by the continuity of q(x) there 
exist positive numbers m and M such that on [a,b] we have 0 < m 2 < q(x) < M 2 . 
Thus, in the sense made precise in Section 25, yfx) oscillates more rapidly on 
[a,b] than solutions of 


y" + Xm 2 y = 0, 


and less rapidly than solutions of 


y" + XM 2 y = 0. 


By Lemma 2, when X is positive and small (so small that n/JXM >b-a ) the 
function yfx) has no zeros in [a,b\ to the right of a; and when X increases to 
the point where, nj y[XM <b-a then yfx) has at least one such zero. Similarly, 
as X increases to the number of zeros of y fx) in [a,b\ tends toward It fol¬ 
lows from Lemma 1 that the nth zero of yfx) to the right of a moves to the left 
as X increases, and we shall take it for granted (it can be proved) that this zero 
moves continuously Consequently, as X starts at 0 and increases to there 
are infinitely many values X lr X 2 , ..., X n , ... for which a zero of yfx) reaches b 
and subsequently enters the interval, so that y kn (x) vanishes at a and b and 
has n - 1 zeros in (a,b). To show that the sequence X v X 2 , ..., X nr ... approaches 
we appeal to the inequalities (2), which in this case become 

Jx^m(b - a) yJX^M(b - a) 


n 


K 
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or 


2 2 2 2 
n n „ n n 
<X n < 


M 2 (b-a) 2 


m 2 (b-a) 


2 ' 


Equation (3) is the special case of the Sturm-Liouville equation 


d_ 

dx 


ax 


+ A,^(x)y = 0 


(4) 


in which p(x) = 1. We assume here that p(x) and q(x) are positive continuous 
functions on [a,b\, and also that p(x) has a continuous derivative on this inter¬ 
val. If we change the independent variable in (4) from x to a new variable zv 
defined by 



dt 

WY 


so that 


dw _ 1 
dx p(x) 


and 


dy dy dzv _ 1 dy 

dx dzv dx p(x) dzv' 


then (4) takes the form 


Yl I + Xq 1 (zv)y = 0, (5) 

where qfw) is positive and continuous on the transformed interval 0 < w < 
c-zv(b). On applying Lemma 3 to equation (5), we immediately obtain the 
following statement about (4). 


Theorem A. Consider the boundary value problem 


d_ 

dx 



dy 

dx 


+ Xq(x)y = 0, 


y(a) = y(b) = 0, 


( 6 ) 


zvhere p(x) and q(x) satisfy the conditions stated above. Then there exists an increas¬ 
ing sequence of positive numbers 
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that approaches °° and has the property that (6) has a nontrivial solution if and only 
ifX equals one of the X n . The solution corresponding to X-X n is unique except for an 
arbitrary constant factor, and has exactly n - i zeros in the open interval ( a,b ). 


One final remark is in order. As pointed out in Section 43, we usually refer 
to (6) as a regular Sturm-Liouville problem because the interval is finite 
and the functions p(x) and q(x) are positive and continuous on the entire 
interval. Singular problems arise when the interval is infinite, or when it is 
finite and p(x) or q(x) vanishes or is discontinuous at one or both endpoints. 
These problems are considerably more difficult, and of course are not cov¬ 
ered by our discussion in this appendix. Unfortunately, many of the most 
interesting differential equations are singular in this sense. We mention 
Legendre's equation 


d_ 

dx 


(1-x 2 ) 


dy 

dx 


+ Xy = 0, 


-1 < x < 1; 


Chebyshev's equation 


d 

dx 


(1-x ) 


2 \i /2 dy 


dx 


+ A.(l - x 2 ) 1/2 y = 0, -1 < x < 1; 


Hermite's equation 


d_ 

dx 


dy 

dx 


+ Xe x y = 0, -oo < x < oo; 


and Laguerre’s equation 


d !_ 

dx 


xe 


-x dy 

dx 


+ Xe x y = 0, 0 < x < c 


These equations appeared in Chapter 5, where they were studied from an 
entirely different point of view. 










Chapter 8 

Some Special Functions of 
Mathematical Physics 


44 Legendre Polynomials 

This section and the next are entirely devoted to the technical task of defin¬ 
ing the Legendre polynomials and establishing a number of their spe¬ 
cial properties. It is natural to wonder about the purpose of this elaborate 
machinery, and more generally, why we care about Legendre polynomials at 
all. The simplest answer is that the Legendre polynomials have many impor¬ 
tant applications to mathematical physics, and these applications depend on 
this machinery. For the benefit of readers who wish to see for themselves, 
the physical background and several typical applications are discussed in 
Appendix A. There is another answer, however, which is less utilitarian and 
applies equally to our subsequent treatment of Bessel functions. It is that 
the study of specific classical functions and their individual properties pro¬ 
vides a healthy counterpoise to the abstract ideas that sometimes seem to 
dominate contemporary mathematics. In addition, we mention several items 
that arise naturally in the context of this chapter which we hope will be of 
interest to all students of mathematics: the gamma function and the formula 

! = \fm; Lambert's continued fraction for the tangent. 
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whose sums were discovered by Euler in the early eighteenth century and 
which appear again in a surprising way in connection with the zeros of 
Bessel functions. 

Now for the Legendre polynomials themselves, which we approach by 
way of the hypergeometric equation. 1 

In Section 28 we used Legendre's equation to illustrate the technique of 
finding power series solutions at ordinary points. Lor reasons explained in 
Appendix A, we now write this equation in the form 

(1 - x 2 )y" - 2 xy' + n(n + 1 )y = 0, (1) 

where n is understood to be a non-negative integer. The reader will recall that 
all the solutions of (1) found in Section 28 are analytic on the interval -1 < x < 1. 
However, the solutions most useful in the applications are those bounded 
near x = 1, and for convenience in singling these out we change the indepen- 
dent variable from x to t- —(1 - x). This makes x = 1 correspond to t = 0 and 
transforms (1) into ^ 

t( 1 - t)y" + (1 - 2 t)y' + n(n + l)y = 0, (2) 

where the primes signify derivatives with respect to t. This is a hypergeo¬ 
metric equation with a = -n, b = n + 1, and c = 1, so it has the following poly¬ 
nomial solution near t - 0: 


yi = F(~n, n + 1,1, t). (3) 

Since the exponents of (2) at the origin are both zero (m, = 0 and m 2 = 1 - c = 0), 
we seek a second solution by the method of Section 16. This second solution 
is y 2 = vy v where 


v = 


1 -f Pdt 
-re 

yi 


1 j(2t-l)/t(l-t)dt 

2 ^ 

yi 


i 

yif(i-t) 


l l 
t yl(l-t) 


by an elementary integration. Since y\ is a polynomial with constant term 1, 
the bracketed expression on the right is an analytic function of the form 1 + 
af + a 2 t 2 + ■ • ■, and we have 


1 Adrien Marie Legendre (1752-1833) encountered his polynomials in his research on the 
gravitational attraction of ellipsoids. He was a very good French mathematician who had 
the misfortune of seeing most of his best work—in elliptic integrals, number theory, and the 

method of least squares—superseded by the achievements of younger and abler men. For 
instance, he devoted 40 years to his research on elliptic integrals, and his two-volume treatise 
on the subject had scarcely appeared in print when the discoveries of Abel and Jacobi revolu¬ 
tionized the field completely. He was very remarkable for the generous spirit with which he 
repeatedly welcomed newer and better work that made his own obsolete. 
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v — —h a\ + a21 + • • •. 

This yields o = log t + af + ■ ■ ■, so 

3/z = 3/i(log t + af + ■ ■ •) 

and the general solution of (2) near the origin is 

y = c 1 yi + c 2 y 2 - 


(4) 


Because of the presence of the term log f in y 2 , it is clear that (4) is bounded 

\ 

near t - 0 if and only if c 2 = 0. If we replace t in (3) by —(1 - x), it follows that 

the solutions of (1) bounded near x-1 are precisely constant multiples of the 

\ 

polynomial F[-n,n + 1,1, —(1 - x)]. 

This brings us to the fundamental definition. The nth Legendre polynomial 
is denoted by P„(x) and defined by 


P n (x) = F 


—n, n + 1,1, — (1 — x) 


= 1 + 


(-n)(« + l) f 1-x 

( l !) 2 


(-n)(-n +1 )(n +1 )(n + 2 )( 1-x 

w 

(-n)(-n + 1) • • -[-n + (n- l)](n +1 )(n + 2) • • -(2 n) 


(niy 


= 1 + 


+ ••• + 


1-x 
2 

n(n + 1) 




( 1 !) 2 
(2 n)\ 
{n\f2 


( 2 !) 2 




(5) 


We know from our work in Section 28 that P„(x) is a polynomial of degree 
n that contains only even or only odd powers of x according as n is even or 
odd. It can therefore be written in the form 

P n (x) = a n x" + fl„_ 2 x"- 2 + fl„_ 4 x n - 4 + ■ • -, (6) 

where this sum ends with a Q if n is even and a t x if n is odd. It is clear from (5) 
that P„(l) = 1 for every n, and in view of (6) we also have P„(-l) = (-1)". 

As it stands, formula (5) is a very inconvenient tool to use in studying 
P„(x), so we look for something simpler. We could expand each term in 
(5), collect like powers of x, and arrange the result in the form (6), but this 
would be unnecessarily laborious. What we shall do is notice from (5) that 
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a n - (2n)\/(n\) 2 2" and calculate a n _ 2 , a n _ 4/ ... recursively in terms of a n . What is 
needed here is formula 28-(9) with p replaced by n and n by k - 2: 


&k—- 


(n-k + 2)(n + k-l) 
(k-1 )k 


0-k-2 


or 

a k{k-\) 

k ~ 2 (n-k + 2)(n + k-l) k ' 


When k = n,n- 2 ,this yields 


a 


a 


n(n-\) 

1-2 2(2n-l) 

(«-2)(«-3) 

! ” 4 4(2n - 3) ”" 2 

_ n(n -1 )(n - 2 )(n - 3) 
2-4(2n-l)(2«-3) 


a >!, 


and so on, so (6) becomes 




x »(»-!) h-2 

" 2(2n -1) 


(n\) 2 l" 

| n(n-l)(n-2)(n-3) x „_4 | 

2 • 4(2n - l)(2n - 3) 

n(n -!)• • -(n -2k + 1) 


+(-l) 


2 k k !(2 n - 1)(2 n-3)—(2n-2k + l) 


x n - 2k + ■■■]■ 


( 7 ) 


Since 


n (n -1) - • • (n - 2fc +1) = -— 

(n-2k)\ 


(2 n -2k + 1)(2 n - 2k + 3) • ••(2n - 3)(2« -1) 

_ (2 n-2k + 1)(2 n-2k + 2)(2 n -2k + 3)—{2n- 3)(2 n - 2)(2 n- 1)2 n 
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the coefficient of x n ~ 2k in (7) is 


(-l) k 


n\ (2n-2k)\2 k n\ _ k (n\f(2n-2k)\ 


2 k\(n-2k)\ (2 n)\(n-k)\ 


= (- 1 ) 


kl(2n)\(n-k)\(n-2k)\ 


This enables us to write (7) as 

In/2] 


P„(X)^(- if- 


(2n-2k)\ xll _ 2k 


k =0 


2 n k\(n-k)\(n-2k)\ 


( 8 ) 


where [n/2] is the usual symbol for the greatest integer <n/2. We continue 
toward an even more concise form by observing that 


[n/2] 


Pn(x) = ^ 

k =0 
[n/2] 

-X 


k =0 


(-If (2n-2fc)! a 
2 n k\(n-k)\ (n-2k)\ ' 

(-If d" x2n _ 2k 

Tk\(n-k)\ dx n 


i jn [ " /2) i 

——— y ——— 

2”n! dx" j-Jk\(n-k)V ' v 


If we extend the range of this sum by letting k vary from 0 to n —which 
changes nothing since the new terms are of degree <n and their nth deriva¬ 
tives are zero—then we get 


P„(x) = 


1 d n 
Tn\ dx n 


k=0 V J 


and the binomial formula yields 


Pn(x) = 


1 d" 
2" n ! dx n 


(x 2 -l)". 


(9) 


This expression for P„(x) is called Rodrigues' formula. 2 It provides a relatively 
easy method for computing the successive Legendre polynomials, of which 
the first few (Figure 58) are 


2 Olinde Rodrigues (1794-1851) was a French banker who came to the aid of Claude Flenri 
Saint-Simon (the founder of socialism) in his destitute old age, supported him during the last 
years of his life, and became one of his earliest disciples. He discovered the above formula 
in 1816, but soon thereafter became interested in the scientific organization of society and 
never returned to mathematics. The term "Rodrigues' formula" is often applied by transfer¬ 
ence to similar expressions for other classical polynomials of which Rodrigues himself knew 
nothing. 
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FIGURE 58 


Pq(x) = 1, Pfx) = x, 

P 2 (x) = |(3x 2 -1), P 3 (x) = ^(5x 3 - 3x). 

An even easier procedure is suggested in Problem 2, and a more significant 
application of (9) will appear in the next section. 


Problems 

1. The function on the left side of 1 

. ^ — = P Q (x) + P 1 (x)t + P 2 (x)t 2 + • • • + P„(x)t n +■■■ 

\jl-2xt + t 2 


is called the generating function of the Legendre polynomials. Assume 

that this relation is true, and use it 

(a) to verify that P„(l) = 1 and P„(-l) = (-1)"; 


(b) to show that P 2 „+i(0) 


0 and P 2 „(0) = (-!)" 


l-3---(2n-l) 


2 "n\ 
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2. Consider the generating relation in Problem 1, 

1 00 

=yy n ( X r. 

1-2 xt + t 2 


Vu 


(a) By differentiating both sides with respect to t, show that 


( x-t)S^P n (x)t n = (1 -2 xt + t 2 j*}TnP„(x)t n 


(b) Equate the coefficients of t" in (a) to obtain the recursion formula 

(n + l)P, !+1 (x) = (In + 1 )xP n (x) - nP„_fx). 

(c) Assume that P 0 (x ) = 1 and P/x) = x are known, and use the recursion 
formula in (b) to calculate P 2 (x), P 3 (x), P 4 (x), and P 5 (x). 

3. Establish the generating relation of Problems 1 and 2 by the following 
steps: 

(a) Use the binomial series to write 


[l-t(2x-t)T in 


l + U(2x-t) + ^t 2 (2x-tf + ■■■ 


+ 


1,3 7 (2n 3) t n ~\2x-ty- 1 

r-fn-iy. v ' 


+ 1' 3 "‘(2” 1) t n (2x-t) n + 
2 n n\ 


(b) It is clear that t n can occur only in terms out to and including the 
last term written in (a). By expanding the various powers of 2x - t, 
show that the total coefficient of t" is 


l-3-(2n-l) l-3-(2n-3) n-1 2 

Tn\ 1 ’ 2"-\n-l)\ 1! ' ’ 

, l-3-(2«-5) (»-2)(n-3) 4 

r- 2 (n-2)1 2! ' ’ 

(c) Show that the sum in (b) is P n (x) as given by (8). 

4. This problem constitutes a direct verification that if P n (x) is defined 
by formula (9), then it satisfies Legendre's equation (1) and has 
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the property that P„(l) = 1 Consider the polynomials of degree n 
defined by 


(a) If w - (x 2 -1)", then (x 2 -1 )w' - 2nxw = 0. By differentiating this equa¬ 
tion k + 1 times, show that 


(x 2 -l)o# +2 > + 2(k +l)xo# +1) + (k + 1 )kwM - 2nxw (k+1) - 2(/c + 1 )nw^> = 0, 


and conclude that y = w <n> is a solution of equation (1). 

(b) Put u = (x — 1)" and v = (x + 1)" and use the formula 

y = = w ( "> v + + ■ ■ ■ + nu®hf n ~ r> + uv^ 

to show that y{ 1) = n\2 n . 


45 Properties of Legendre Polynomials 

In the previous section we defined the sequence of Legendre polynomials 

p 0 (x), pm pm ■ ■ v p,M ■■■■ (!) 

The reader is aware that these polynomials have a number of applications, 
which range from mathematical physics to the theory of approximation. We 
now discuss the fundamental ideas on which some of these applications 
depend. 

Orthogonality. The most important property of the Legendre polynomials 
is the fact that 


i 

j ’p m (x)P n (x) dx = • 

-i 


0 

2 

2n + l 


if m * n, 
if m = n. 


( 2 ) 


This is often expressed by saying that (1) is a sequence of orthogonal functions 
on the interval -1 < x < 1. We shall explain the significance of this property 
after we prove it. 

Let/(x) be any function with at least n continuous derivatives on the inter¬ 
val -1 < x < 1, and consider the integral 
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1 = 


j 'f(x)P n (x)dx. 
-1 

Rodrigues' formula enables us to write this as 


I = - 

and an integration by parts gives 
1 


1 = 


Tn\ 


/(x)-^(x 2 -l)" 


i , 

- \f(x) - -(x 2 -ly'dx. 

l n n'.i J KJ dx n - lK ' 


The expression in brackets vanishes at both limits, so 

jbh x) £*^- irdx: 


! = ■ 


and by continuing to integrate by parts, we obtain 


1 = 


{ ^\f w (x)(x 2 -lf'dx. 


If/(x) = P Jx) with m < n, then/ i " , (x) = 0 and consequently 1=0, which proves 
the first part of (2). To establish the second part, we put /(x) = P n (x). Since 

Pf \x) = (2n)\/2 n n\, it follows that 


I = 


(2 n) 

2 2n (n\) 




dx 


2(2 n)\ 
2 2n (n\) 2 


1 

J*(l-x 2 )" dx. 


(3) 


If we change the variable by writing x = sin 0, and recall the formula (proved 
by an integration by parts) 


| cos 2,!+1 0 dO = —-— cos 2 " 0 sin 0 + f 

2n + 1 2n + l J 


cos 2 " 1 0dQ, 


(4) 
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then the definite integral in (3) becomes 


rc /2 
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cos 2 ” +1 0 d0 = 


In 


In + 


u/2 

ll 
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9 d6 


2 n 2 n - 2 
2 n + 1 2n -1 
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ti/2 

J* cos 9 d0 

o 


2"n! _ 2 2 "(«!) 2 

l-3---(2n-l)(2n + l) “ (2n)!(2n + l)' 


We conclude that in this case I = 2/(2 n + 1), and the proof of (2) is complete. 


Legendre series. As we illustrate in Appendix A, many problems of poten¬ 
tial theory depeind on the possibility of expanding a given function in a 
series of Legendre polynomials. It is easy to see that this can always be done 
when the given function is itself a polynomial. For example, formulas 44-(10) 
tell us that 

l = PoM, x = P 1 (x), x 2 =| + |p 2 (x) = |p 0 (x) + |p 2 (x) 

x 3 = |x + |p3(x) = |p 1 (x) + |p 3 (x); 

DO D D 

and it follows that any third-degree polynomial p(x) - b 0 + b x x + b 2 x 2 + b 3 x 3 
can be written as 


p(x) = b 0 P 0 (x) + btPfx) + b 2 


|p 0 (x) + |p 2 (x) 


+ bri 


|p 1 (x) + |p 3 (x) 

D D 


3b 


2bj 


2h 


— b 3 + —— P 0 (x)+ b x —— Pi(x) + —— P 2 (x)-i— —Psix) 


a 

flnPn{x)' 


More generally, since P„(x) is a polynomial of degree n for every positive 
integer n, a simple extension of this procedure shows that x" can always be 
expressed as a linear combination of P 0 (x), P,(x), ..., P„(x), so any polynomial 
p(x) of degree k has an expansion of the form 
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k 

p(x)=y\„p„(x). 

n =0 


An obvious problem that arises from these remarks—and also from the 
demands of the applications—is that of expanding an "arbitrary" function 
f(x) in a so-called Legendre series: 


/(x) = y\„P„(x). (5) 

n=0 


It is clear that a new procedure is needed for calculating the coefficients a n in 

(5) , and the key lies in formulas (2). 

If we throw mathematical caution to the winds, and multiply (5) by P m (x) 
and integrate term by term from -1 to 1, then the result is 

1 oo 1 

|/(x)P„,(x) dx = ^a n \ux)P n (x) dx ; 

-i «=o -i 

and in view of (2), this collapses to 

1 

[f(x)P m (x)dx= 2a ' n 
J 2m +1 

-l 

We therefore have the following formula for the a n in (5): 

U " = (” + 1 ) dx - ( 6 ) 

These manipulations are easy to justify if/(x) is known in advance to have 
a series expansion of the form (5) and this series is integrable term by term 
on the interval -1 < x < 1. Both conditions are obviously satisfied when/(x) is 
a polynomial; but in the case of other types of functions we have no way of 
knowing this, and our conclusion that the coefficients a n in (5) are given by 

(6) is of doubtful validity. Nevertheless, these formal procedures are highly 
suggestive, and can lead to legitimate mathematics if we ask the following 
question. If the a„ are defined by formula (6) and then used to form the series 
(5), for what kinds of functions/(x) will these a n exist and the expansion (5) 
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be valid? This question has an answer, but this is not the place to go into 
details. 3 

The possibility of expansions of the form (5) obviously depends in a cru¬ 
cial way on the orthogonality property (2) of the Legendre polynomials. 
This is an instance of the following general phenomenon, which is often 
encountered in the theory of special functions. If a sequence of functions 
4> 1 ( x ), W*)/ • • v Jyfx), ... defined on an interval a< x<b has the property that 



then the <|> H are said to be orthogonal functions on this interval. Just as above, 
the general problem that arises in connection with a sequence of this kind 
is that of representing "arbitrary" functions/(x) by expansions of the form 



and a formal use of (7) suggests that the coefficients a n ought to be given by 



a 


Additional examples occur in Appendices B and D of Chapter 5, where the 
orthogonality (with respect to suitable weight functions) of the Hermite 
polynomials and Chebyshev polynomials is briefly mentioned. The satis¬ 
factory solution of this group of problems was one of the main achieve¬ 
ments of pure mathematics in the nineteenth and early twentieth centuries. 
Also, Chapter 6 contains a fairly full treatment of the classical problem that 


3 The answer we refer to—often called the Legendre expansion theorem —is easy to understand, 
but its proof depends on many properties of the Legendre polynomials that we have not 
mentioned. This theorem makes the following statement: If both/(r) and/'(r) have at most a 
finite number of jump discontinuities on the interval -1 < x < 1, and if f(x~) and/(r+) denote the 
limits of/(x) from the left and from the right at a point x, then the a„ exist and the Legendre 
series converges to 


\ [/(*-) +/(*+)] 


for - 1 < x < 1, to /(- 1 +) at x = -l, and to /(I -) at x = 1-and in particular, it converges to 
f(x) at every point of continuity. See N. N. Lebedev, Special Functions and Their Applications, 
pp. 53-58, Prentice-Hall, Englewood Cliffs, N.J., 1965. 
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underlies all of these ideas—that of expanding suitably restricted functions 
in Fourier series. 


Least squares approximation. Let/(x) be a function defined on the interval 
-1 < x < 1, and consider the problem of approximating fix) as closely as pos¬ 
sible in the sense of least squares by polynomials p(x) of degree <n. If we 
think of the integral 


I = \[f(x)-P(x)fdx (8) 

-i 

as representing the sum of the squares of the deviations of p(x) from fix), then 
the problem is to minimize the value of this integral by a suitable choice of 
p(x). It turns out that the minimizing polynomial is precisely the sum of the 
first n + 1 terms of the Legendre series (5), 


p(x) = a 0 P 0 (x) + ■ • • + a n P n (x), 


where the coefficients are given by (6). 

To prove this, we use the fact that all polynomials of degree <n are 
expressible in the form b 0 P 0 (x) + ■ ■ ■ + b„P n (x). The integral (8) can therefore 
be written as 


= | f{x)-%P k {x) 

-lL k=0 

1 n .. 

= j f(x) 2 dx + X 2^y # ~ 2 2> (*) dx 

_1 k =0 ^ k =0 |__i 

= [f(x) 2 dx + y 2 bi - 2 V b k 2cik 
' Lu 2k+ 1 Au k 2k + 1 

_1 k =0 k =0 


2 

-a k . 


Since the a k are fixed and the b k are at our disposal, it is clear that I assumes 
its minimum value when b k - a k for k = 0,The only hypothesis required 
by this argument is that f(x) and fix) 2 must be integrable. If the function 
fix) is sufficiently well behaved to have a power series expansion on the 
interval -1 < x < 1 , then most students assume that the "best" polynomial 











406 


Differential Equations with Applications and Historical Notes 


approximations to f(x) are given by the partial sums of this power series. 
The result we have established here shows that this is false if our criterion is 
approximation in the sense of least squares. 


Problems 

1. Verify formula (4). 

2. Legendre's equation can also be written in the form 

-^-[(l-x * 1 2 3 )y'] + n{n + l)y=0. 
ax 

so that 

~T~ [(1 - x 2 )P' m ] + m(m + 1)P,„ = 0 
ax 

and 


^[(l-x 2 )P'] + n(n + l)P„=0. 
ax 

Use these two equations to give a proof of the first part of formula (2) 
that does not depend on the specific form of the Legendre polynomials 
Hint: Multiply the first equation by P„ and the second by P m , subtract, 
and integrate from -1 to 1. 

3. If the generating relation given in Problems 1 and 2 of Section 44 is 
squared and integrated from x = - 1 to x = 1 , then the first part of (2) 
implies that 


1 


dx 

l-2xt + t 2 



Establish the second part of (2) by showing that the integral on the left 
has the value 
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4. Find the first three terms of the Legendre series of 



(b) /(x) = eh 

5. If p(x) is a polynomial of degree n> 1 such that 


jx k p(x) dx = 0 fork = 0,1,..., n-1. 


show that p(x) = cP„(x ) for some constant c. 

6. If P n (x) is multiplied by the reciprocal r of the coefficient of x", then 
the resulting polynomial rP n (x) has leading coefficient 1. Show that this 
polynomial has the following minimum property: Among all polyno¬ 
mials of degree n with leading coefficient 1, rP„(x) deviates least from 
zero on the interval -1 < x < 1 in the sense of least squares. 


46 Bessel Functions. The Gamma Function 

The differential equation 


x 2 y" + xy' + (x 2 - p 2 )y = 0, 


(1) 


where p is a non-negative constant, is called Bessel's equation, and its solu¬ 
tions are known as Bessel functions. These functions first arose in Daniel 
Bernoulli's investigation of the oscillations of a hanging chain (Problem 
40-4), and appeared again in Euler's theory of the vibrations of a circular 
membrane and Bessel's studies of planetary motion. 4 More recently, Bessel 
functions have turned out to have very diverse applications in physics and 
engineering, in connection with the propagation of waves, elasticity, fluid 


4 Friedrich Wilhelm Bessel (1784-1846) was a famous German astronomer and an intimate 
friend of Gauss, with whom he corresponded for many years. He was the first man to deter¬ 
mine accurately the distance of a fixed star: his parallax measurement of 1838 yielded a dis¬ 
tance for the star 61 Cygni of 11 light-years or about 360,000 times the diameter of the earth's 
orbit. In 1844 he discovered that Sirius, the brightest star in the sky, has a traveling compan¬ 
ion and is therefore what is now known as a binary star. This Companion of Sirius, with the 
size of a planet but the mass of a star, and consequently a density many thousands of times 
the density of water, is one of the most interesting objects in the universe. It was the first dead 
star to be discovered, and occupies a special place in modern theories of stellar evolution. 
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motion, and especially in many problems of potential theory and diffusion 
involving cylindrical symmetry They even occur in some interesting prob¬ 
lems of pure mathematics. We present a few applications in Appendix B, but 
first it is necessary to define the more important Bessel functions and obtain 
some of their simpler properties. 5 


The definition of the function J p (x). We begin our study of the solutions of 
(1) by noticing that after division by x 2 the coefficients of y' and y are P(x) = 
\/x and Q(x) = (x 2 - p 2 )/x 2 , so xP(x) = 1 and x 2 Q(x) = -p 2 + x 2 . The origin is 
therefore a regular singular point, the indicial equation 30-(5) is m 2 -p 2 - 0, 
and the exponents are »/, = p and m 2 = -p. It follows from Theorem 30-A that 
equation (1) has a solution of the form 

V = xP £ a n x n = £ a n x" + r, (2) 

where a 0 * 0 and the power series £ a n x n converges for all x. To find this solu¬ 
tion, we write 


y' = 5> + p)a n x n+r - 1 


and 


y” = Y,( n + p - 1 )(n + p)a n x n+ P~ 2 . 

These formulas enable us to express the terms on the left side of equation (1) 
in the form 


x 2 y" = Yfn + V _ l)( n + P) a n x " +r ' 

x y' = E( w + pK*" +p / 

xl y = xn+r , 

-p 2 y = YrP 2a n x " +v - 

If we add these series and equate to zero the coefficient of x n+ r, then after 
a little simplification we obtain the following recursion formula for the a n : 

n(2p + n)a„ + a n _ 2 = 0 (3) 


5 The entire subject is treated on a vast scale in G. N. Watson, A Treatise on the Theory of Bessel 
Functions, 2d ed., Cambridge University Press, London, 1944. This is a gargantuan work of 
752 pages, with a 36-page bibliography of 791 items. What we shall discuss amounts to lit¬ 
tle more than the froth on a heaving ocean of scientific effort extending over nearly three 
centuries. 
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or 


#)i-2 

n(2p + n) 


(4) 


We know that a 0 is nonzero and arbitrary. Since a , = 0, (4) tells us that = 0; 
and repeated application of (4) yields the fact that a„ = 0 for every odd sub¬ 
script n. The nonzero coefficients of our solution (2) are therefore 


a 0r a 2 — ~ 


$4 — — 


«0 


2(2p + 2) 

«2 


a o 


Uf, — — 


4(2p + 4) 2-4(2p + 2)(2p + 4) 

^4 Uq 


6(2p + 6) 2 • 4 • 6(2p + 2)(2p + 4)(2p + 6) 


and the solution itself is 


y = no* 


= a 0 x v 


x~ 


x 


2 (p + 1) 2 2!(p + l)(p + 2) 2 3!(p + l)(p + 2)(p + 3) 

x 2n 


Z (-l) n , 

V ' 2 2n n 

n =0 


!(P + l)-(P + «) 


(5) 


The Bessel function of the first kind of order p, denoted by J (x) is defined by put¬ 
ting a 0 = 1/2 p p\ in (5), so that 


Jp(%) — 


X? ^ x 2n 

■ A - X /_4 \n • /V 

2 2 "n\(p + l)---(p + n) 


y (x/2) 2n+p ' 
' n\{p + n)\ 


( 6 ) 


The most useful Bessel functions are those of order 0 and 1, which are 


/„(*) = £(-1)" 

n =0 


1 



= 1 - 




2 2 • 4 2 • 6 2 




(7) 
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FIGURE 59 

and 


n =0 


l 

n\(n + l)\ 



% i 1 fxY 

2~1!2!UJ + 2!3!UJ 


( 8 ) 


Their graphs are shown in Figure 59. These graphs display several interest¬ 
ing properties of the functions J 0 (x) and Jfx) each has a damped oscillatory 
behavior producing an infinite number of positive zeros; and these zeros 
occur alternately, in a manner suggesting the functions cos x and sin x. This 
loose analogy is strengthened by the relation ff x) = which we ask the 
reader to prove and apply in Problems 1 and 2. 

We hope the reader has noticed the following flaw in this discussion—that 
J p (x) as defined by (6) is meaningless unless the non-negative real number p is 
an integer, since only in this case has any meaning been assigned to the fac¬ 
tors (p + n)\ in the denominators. We next turn our attention to the problem 
of overcoming this difficulty. 

The gamma function. The purpose of this digression is to give a reasonable 
and useful meaning to p\ [and more generally to (p + u)\ for n = 0, 1, 2, ...] 
when the non-negative real number p is not an integer. We accomplish this 
by introducing the gamma function T(p), defined by 

co 

T(p) = dt, p> 0. (9) 

o 

The factor e~ l -*■ 0 so rapidly as t -*■ °° that this improper integral converges 
at the upper limit regardless of the value of p. However, at the lower limit we 
have er f ->• 1, and the factor P’- 1 -> oo whenever p < 1. The restriction that p must 
be positive is necessary in order to guarantee convergence at the lower limit. 

It is easy to see that 


r(p + l)=pT(p); 


( 10 ) 
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for integration by parts yields 


F (p +1) = lim \t v e l dt 

oo J 


= lim 

b-> oo 


-t p e 


im 

—>co J 


V 

( b 

lim 

b->< 

V 0 


b \ 

e~*dt 
o j 

\ 


= pr(p), 


since bi"/e h -* 0 as b -* If we use the fact that 


oo 

r(i) = jV'dt = i, (ii) 

o 

then (10) yields T(2) = ir(l) = 1, T(3) = 2r(2) = 2-1, T(4) - 3r(3) = 3 • 2 • 1, and in 
general 


r (n + 1) = n\ 


( 12 ) 


for any integer n > 0. 

We began our discussion of the gamma function under the assumption 
that p is non-negative, and we mentioned at the outset that the integral (9) 
does not exist if p = 0. However, we can define F(p) for many negative p’s 
without the aid of this integral if we write (10) in the form 


r (p) = 


r(p + i) 
v 


(13) 


This extension of the definition is necessary for the applications, and it begins 
as follows: If -1 < p < 0, then 0 < p + 1 < 1, so the right side of equation (13) has 
a value and the left side of (13) is defined to have the value given by the right 
side. The next step is to notice that if -2 < p < -1, then -1 < p + 1 < 0, so we can 
use (13) again to define T(p) on the interval -2 < p < -1 in terms of the values 
of T (p + 1) already defined in the previous step. It is clear that this process can 
be continued indefinitely. Furthermore, it is easy to see from (11) that 

limT(p) = lim + ^ = ±oo 

0 p-» 0 p 
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FIGURE 60 

according as p —> 0 from the right or left. The function F(p) behaves in a 
similar way near all negative integers, and therefore its graph has the gen¬ 
eral appearance shown in Figure 60. We will also need to know the curious 
fact that 


T 



= Vn. 


(14) 


This is indicated in the figure, and its proof is left to the reader (in Problem 3). 
Since T(p) never vanishes, the function 1 /T(p) will be defined and well behaved 
for all values of p if we agree that 1 /T(p) = 0 for p = 0, -1, -2, .... 

These ideas enable us to define p\ by 

p\ = r(p + 1) 

for all values of p except negative integers, and by formula (12) this func¬ 
tion has its usual meaning when p is a non-negative integer. Its reciprocal, 
1/p! = 1 /F(p + 1), is defined for all p's and has the value 0 whenever p is a 
negative integer. 
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The gamma function is an extremely interesting function in its own right. 
However, our purpose in introducing it here is solely to guarantee that the 
function J p (x) as defined by formula (6) has a meaning for every p > 0. We 
point out that even more has been achieved—since 1 /(p + n)\ now has a mean¬ 
ing for every p + n, (6) defines a perfectly respectable function of x for all 
values of p, without exception. 

The general solution of Bessel's equation. Our present position is this: we 
have found a particular solution of (1) corresponding to the exponent m, = p, 
namely, J f ,(x). In order to find the general solution, we must now construct a 
second independent solution—that is, one that is not a constant multiple of 
J (x). Any such solution is called a Bessel function of the second kind. The natural 
procedure is to try the other exponent, m 2 = -p. But in doing so, we expect to 
encounter difficulties whenever the difference m x - m 2 = 1 p is zero or a posi¬ 
tive integer, that is, whenever the non-negative constant p is an integer or half 
an odd integer. It turns out that the expected difficulties are serious only in 
the first case. 

We therefore begin by assuming that p is not an integer. In this case we 
replace p by —p in our previous treatment, and it is easy to see that the dis¬ 
cussion goes through almost without change. The only exception is that (3) 
becomes 


n(-2p + n)a„ + a n _ 2 = 0; 


and if it happens that p - 1/2, then by letting n - 1 we see that there is no 
compulsion to choose a 2 = 0. However, since all we want is a particular solu¬ 
tion, it is certainly permissible to put a 1 = 0 The same problem arises when 
p = 3/2 and n = 3, and so on; and we solve it by putting a x = a 3 = ■ ■ ■ = 0 in all 
cases. Everything else goes as before and we obtain a second solution 



(15) 


The first term of this series is 



so J (x) is unbounded near x - 0. Since J (x) is bounded near x - 0 these two 
solutions are independent and 


y = cJ F (x ) + cj_ p (x), p not an integer. 


(16) 


is the general solution of (1). 
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The solution is entirely different when p is an integer m > 0. Formula (15) 
now becomes 


/_,(*)=£(-ir 

n =0 


(x/2) 2 "-" 1 
n\(-m + n)\ 


-B- 1 * 


(x/2) 2n - n 
n\(-m + n)l 


since the factors \/(-m + n)\ are zero when n = 0,1 • • •, m -1. On replacing the 
dummy variable n by n + m and compensating by beginning the summation 
at n = 0, we obtain 


=£(-ir m 

n =0 


(x/2) 2(,l+, ” ) ~ , 

(n + m)\n\ 


=(-irX(- i ) n 

w=0 


(x/2) 2n+m 
n\(m + n)\ 


= (-l )"/»(*)■ 


This show that /_„,(*) is not independent of /,„(*) so in this case 

V = cj m {x) + cj_ m {x) 

is not the general solution of (1), and the search continues. 

At this point the story becomes rather complicated, and we sketch it very 
briefly. One possible approach is to use the method of Section 16 which is 
easily seen to yield 



dx 

Xj,n(x) 2 


as a second solution independent of J m (x). It is customary, however, to pro¬ 
ceed somewhat differently, as follows. When p is not an integer, any function 
of the form (16) with c 2 * 0 is a Bessel function of the second kind, including 
J_ p (x) itself. The standard Bessel function of the second kind is defined by 

/ P Mco 5 p,-;_, M (17) 

sin pn 

This seemingly eccentric choice is made for good reasons, which we describe 
in a moment. First, however, the reader should notice that (16) can certainly 
be written in the equivalent form 
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y = cj p (x) + c 2 Y p (x), p not an integer. 


(18) 


We still have the problem of what to do when p is an integer m, for (17) is 
meaningless in this case. It turns out after detailed analysis that the function 
defined by 


Y m (x) = lim Yp(x) 


(19) 


exists and is a Bessel function of the second kind; and it follows that 


y = cj p (x) + c 2 Y p (x) 


( 20 ) 


is the general solution of Bessel's equation in all cases, whether p is an integer 
or not. The graph of Y 0 (x) is shown by the dashed curve in Figure 59. This 
graph illustrates the important fact that for every p > 0, the function Y p (x) is 
unbounded near the origin. Accordingly, if we are interested only in solu¬ 
tions of Bessel's equation that are bounded near x = 0, and this is often the 
case in the applications, then we must take c 2 - 0 in (20). 

Now for the promised explanation of the surprising form of (17). We have 
pointed out that there are many ways of defining Bessel functions of the 
second kind. The definitions (17) and (19) are particularly convenient for two 
reasons. First, the form of (17) makes it fairly easy to show that the limit (19) 
exists (see Problem 9). And second, these definitions imply that the behavior 
of Y p (x), for large values of x, is matched in a natural way to the behavior of 
J p (x). To understand what is meant by this statement, we recall from Problem 
24-3 that introducing a new dependent variable u(x) = yfxy(x) transforms 
Bessel's equation (1) into 



( 21 ) 


When x is very large, equation (21) closely approximates the familiar differ¬ 
ential equation u" + u - 0, which has independent solutions ufx) = cos x and 
w 2 (x) = sin x. We therefore expect that for large values of x, any Bessel func¬ 
tion y(x) will behave like some linear combination of 


1 

—j=C osx 
yx 


and 



1 


smx. 


This expectation is supported by the fact that 
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and 


Y P (x) = 



pn\r 2 (x) 
2 J x 3/2 ' 


where rfx) and r 2 (x ) are bounded as x -> °°. 6 


Problems 

1. Use (7) and (8) to show that 

(a) -f Jo ( x) = -J l( x ); 
ax 

(b) ^~[xh(x)] = xj 0 (x). 
dx 

2. Use Problem 1 and Rolle's theorem to show that: 

(a) Between any two positive zeros of J 0 (x) there is a zero of J-fx). 

(b) Between any two positive zeros of Jfx) there is a zero of J 0 (x). 

3. According to the definition (9), 



(a) Show that the change of variable t - s * 1 2 3 leads to 



(b) Since s in (a) is a dummy variable, we can write 


P 




= 4j*j*e“ ( * 2+!/2) dx dy. 
o o 


6 See Watson, op. cit., chap. VII (footnote 5); or R. Courant and D. Hilbert, Methods of Mathematical 
Physics, vol. 1, pp. 331-334, 526, Interscience-Wiley, New York, 1953. 
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By changing this double integral to polar coordinates, show that 


7l/2a 


M = 4 j* je '~rdrdQ = n, 
oo 


so r[ - 1 = 4n. 


4. Since p\ = I '(p + 1) whenever p is not a negative integer, (14) says that 
^)’ = ^" a ^ cu ^ ate anc ^ More generally, show that 


n + - |! = 


(2n + l)! 

2 2,1+1 n! 


Vn 


and 


n — ! = 


(2m) ! 


2) 2 2 "n\ 




for any non-negative integer n. 

5. When p = 1/2, equation (21) shows that the general solution of Bessel's 
equation is expressible in either of the equivalent forms 

y = —j= (c\ cos x + C 2 sin x) 
fx 


and 


y = cj 1/2 (x) + cj_ 1/2 (x). 


It therefore must be true that 


and 


y[xjy 2 (x) = acosx + bsinx 


JxJ-m(x) = c cos x + d sin x 


for certain constants a, b, c, and d. By evaluating these constants, 
show that 



and 


7 - 1/2 (x) - 



cosx- 


Jin(x) - 
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6. Establish the formulas in Problem 5 by direct manipulation of the series 
expansions of J 1/2 and J_ 1/2 (x). 

7. Many differential equations are really Bessel's equation in disguised 
form, and are therefore solvable by means of Bessel functions. For 
example, let Bessel's equation be written as 


2 d w dw , 22 , n 

z — T + z — + (z- p l )zv = 0, 
dz dz 


and show that the change of variables defined by z = ax b and iv = yx c 
(where a, b, and c are constants) transforms it into 

x 2 ^4 + (2c + l)x d ^- + [a 2 b 2 x 2b + (c 2 - p 2 b 2 )]y = 0. 
dx dx 


Write the general solution of this equation in terms of Bessel functions. 

8. Use the result of Problem 7 to show that the general solution of Airy's 
equation y" + xy - 0 (see Problem 28-5) is 


y = x 


1/2 


2 3/2 I . t ( 2 3/2 


Cl/1/3 | — X j + C 2 /_i/ 3 l — X 


9. Apply l'Hospital's rule to the limit (19) to show that 


Vm{x) = - 
71 


±J p (x)-(-Y>"'±J_ p (x) 
op op 




47 Properties of Bessel Functions 

The Bessel function } (x) has been defined for any real number p by 


/ p (x)=^(-ir 

n =0 


(x/2) 2n+p 
n\(p + n)\ 


( 1 ) 


In this section we develop several properties of these functions that are use¬ 
ful in their applications. 
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Identities and the functions J m+ 1/2 (x). We begin by considering the formulas 


j-[x v J P {x)] = x ViW 


( 2 ) 


and 


d 

dx 


[x-rjAx)] = -x- p J P+ i(x). 


(3) 


To establish (2), we simply multiply the series (1) by xt' and differentiate: 


d_ 

dx 




(-l) n x 2n+2p 
l 2n+p n\(p + n)l 


y (-l) n x 2n+2p - 1 
la2 2 "+ p - 1 n\(p + n- 1 )! 


= x p Jj-iy 

n =0 


(x/2) 2n+p ~ 1 

«!(p-l + n)! 


= x p J P -i(x). 


The verification of (3) is similar, and we leave the details to the reader in 
Problem 1 below. If the differentiations in (2) and (3) are carried out, and the 
results are divided by x ±p , then the formulas become 


Jp(x) + £j P (x) = J p - 1 (x) 


(4) 


and 


J' p (x)-Pj p (x) = J p+1 (x). (5) 

x 

If (4) and (5) are first added and then subtracted, the results are 


2 Jp(x) Jp-i(x') Jp + i(x) 


( 6 ) 


and 


2p 


JpM Jp-i( x )+Jp+i( x )- 


X 


(7) 
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These formulas enable us to express Bessel functions and their derivatives in 
terms of other Bessel functions. 

An interesting application of (7) begins with the formulas 


hn(x) - 



and 


J- 1/2(2) - 



which were established in Problem 46-5. It now follows from (7) that 


J3/2M - —J1/2W-I-1/2W - J — 

7IX 


sin x 


- - COS X 


and 


/5/2(*) - — / 3 / 2 M _ Jl/2( X ) - 

X 


3 sin x 3 cos x 


- sm x 


Also, 


/- 3 / 2 W - J-l/2( x ) - Jl/2( x ) - 

x 


cosx 


— sinx 


and 


T /rl- 3 T M T [*~f 3 cosx 3sinx 

x \ nx\ x x J 

It is clear that calculations of this kind can be continued indefinitely, and 
therefore every Bessel function J m+1/2 ( x ) (where in is an integer) is elementary. 
It has been proved by Liouville that these are the only cases in which J (x) is 
elementary. 7 

Another application of formula (7) is given at the end of Appendix C, 
where we show how it yields Lambert's continued fraction for tan x. This 
continued fraction is of great historical interest, for it led to the first proof of 
the fact that it is not a rational number. 

When the differentiation formulas (2) and (3) are written in the form 

J*x p / p _i(x) dx = x p J p (x) + c (8) 


7 The details of this remarkable achievement can be found in Watson, op. cit., chap. IV, and in 
J. F. Ritt, Integration in Finite Terms, Columbia University Press, New York, 1948. The functions 
Jm+1/2W are often called spherical Bessel functions because they arise in solving the wave equa¬ 
tion in spherical coordinates. 
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and 



(9) 


then they serve for the integration of many simple expressions containing 
Bessel functions. For example, when p = 1, (8) yields 



( 10 ) 


In the case of more complicated integrals, where the exponent does not 
match the order of the Bessel function as it does in (8) and (9), integration by 
parts is usually necessary as a supplementary tool. 

Zeros and Bessel series. It follows from Problem 24-3 that for every value of 
p, the function J p (x) has an infinite number of positive zeros. This is true in 
particular of J 0 (x). The zeros of this function are known to a high degree of 
accuracy, and their values are given in many volumes of mathematical tables. 
The first five are approximately 2.4048, 5.5201, 8.6537, 11.7915, and 14.9309; 
their successive differences are 3.1153, 3.1336, 3.1378, and 3.1394. The corre¬ 
sponding positive zeros and differences for Jfx) are 3.8317, 7.0156, 10.1735, 
13.3237, and 16.4706; and 3.1839, 3.1579, 3.1502, and 3.1469. Notice how these 
differences confirm the guarantees given in Problem 25-1. 

What is the purpose of this concern with the zeros of J (x)? It is often neces¬ 
sary in mathematical physics to expand a given function in terms of Bessel 
functions, where the particular type of expansion depends on the problem 
at hand. The simplest and most useful expansions of this kind are series of 
the form 


oo 


f(x) = ^ a n J p (X n x) = fli/ p (A,ix) + a 2 J p (X 2 x) + ■■■, 


( 11 ) 


where/(x) is defined on the interval 0 < x < 1 and the /.„ are the positive zeros 
of some fixed Bessel function fix) with p > 0. We have chosen the interval 
0 < x < 1 only for the sake of simplicity, and all the formulas given below can 
be adapted by a simple change of variable to the case of a function defined on 
an interval of the form 0 <x<a. The role of such expansions in physical prob¬ 
lems is similar to that of Legendre series as illustrated in Appendix A, where 
the problem considered involves temperatures in a sphere. In Appendix B 
we demonstrate the use of (11) in solving the two-dimensional wave equa¬ 
tion for a vibrating circular membrane. 
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In the light of our previous experience with Legendre series, we expect the 
determination of the coefficients in (11) to depend on certain integral proper¬ 
ties of the functions J p (X n x). What we need here is the fact that 


0 


0 

\jp + i(K) 2 


if m ^ n, 
if m = n. 


( 12 ) 


In terms of the ideas introduced in Section 43, these formulas say that the 
functions J p (X n x) are orthogonal with respect to the weight function x on the 
interval 0 < x < 1. We shall prove them at the end of this section, but first we 
demonstrate their use. 

If an expansion of the form (11) is assumed to be possible, then multiplying 
through by xJ p (X m x), formally integrating term by term from 0 to 1, and using 
(12) yields 


J xf(x)J p (X m x) dx = -y J p+1 (K,f; 

0 

and on replacing m by n we obtain the following formula for a n : 

i 

a n = , pr '2 [ xf(x)J p (Kx) dx. (13) 

Jp+l(^n) » 

The series (11), with its coefficients calculated by (13), is called the Bessel 
series —or sometimes the Fourier-Bessel series —of the function/(x). As usual, 
we state without proof a rather deep theorem that gives conditions under 
which this series actually converges and has the sum/(x). 8 

Theorem A. (Bessel expansion theorem). Assume that f(x) and f(x) have at 
most a finite number of jump discontinuities on the interval 0 < x < 1. If 0 < x <1, 
then the Bessel series (11) converges to f(x) when x is a point of continuity of this 

\ 

function, and converges to — [f(x-) +/(*+)] when x is a point of discontinuity. 

It is natural to wonder what happens at the endpoints of the interval. At x = 1, 
the series converges to zero regardless of the nature of the function because 
every / P (X„) is zero. The series also converges at x = 0, to zero if p > 0 and to 
/(0+) if p = 0. 


For the proof, see Watson, op. cit., chap. XVIII. 
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As an illustration, we compute the Bessel series of the function/(x) = 1 for 
the interval 0 < x < 1 in terms of the functions J 0 (X n x), where it is understood 
that the X n are the positive zeros of J 0 (x). In this case, (13) is 

i 

a " = t ^ \2 [ x Jo(Kx) dx. 

Jl(A'n) J 


By (10), we see that 


1 


XjoCk n x) dx = 

2 

— x/i( X„x) 

J 

0 

_K n 


JiW 

. / 


so 


2 

an ~Kh(KY 


It follows that 


00 _ 

i = Y 2 


Jo(h n x) 


(0 < x < 1) 


is the desired Bessel series. 


Proofs of the orthogonality properties. To establish (12), we begin with the 
fact that y - J p (x) is a solution of 


y + ~y + 


r i\ 

1-P- 

2 

v * j 


y = fl¬ 


it a and b are distinct positive constants, it follows that the functions u(x) = 
J p (ax) and v(x) = f p (bx) satisfy the equations 


and 


u” + -u' + 
X 


f 2\ 
V * J 


u = 0 


v +—v + 

X 


„2 ^ 


v = 0 . 


(14) 


V 


y 


(15) 
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We now multiply these equations by u and u, the subtract the results, to 
obtain 


—(u'v-v'u) +—(u'v-v'u) = (b 2 -a 2 )uv; 
dx x 

and after multiplication by x, this becomes 

— [x(u'v-v'u)\ = (b 2 -a 2 )xuv. (16) 

dx 

When (16) is integrated from x = 0 to x = 1, we get 

i 

( b 2 - a 2 )jxuv dx = [x(u'v - v'u)]l. 
o 

The expression in brackets clearly vanishes at x = 0, and at the other end of 
the interval we have 2/(1) = J p (a) and z>(1) = f p (b). It therefore follows that the 
integral on the left is zero if a and b are distinct positive zeros and /,„ of 
J p (x); that is, we have 


| xJ p (X m x)J„(Kx) dx = 0, (17) 

o 

which is the first part of (12). 

Our final task is to evaluate the integral in (17) when m = n. If (14) is multi¬ 
plied by 2x 2 i;', it becomes 

2x 2 u'u" + 2xu' 2 + 2a 2 x 2 uu'-2p 2 mi = 0 
or 

—(xV 2 ) + — (a 2 x 2 u 2 )-2a 2 xu 2 (p 2 u 2 ) = 0, 

dx dx dx 


so on integrating from x = 0 to x = 1, we obtain 

i 

2a~ jxu 2 dx = [xhz' 2 + (fl 2 x 2 - p 2 )u 2 ]]. 
o 


( 18 ) 
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When x = 0, the expression in brackets vanishes; and since u'( 1) = affia), (18) 
yields 


i 

jxj p (ax) 2 dx = 
o 




„2 A 




Jp( a ) 2 - 


We now put a = /,„ and get 


j x] p (X n x) 2 dx = | J'fiK ) 2 = |/ p+ i(k„) 2 , 
o 


where the last step makes use of (5), and the proof of (12) is complete. 


Problems 

1. Verify formula (3). 

2. Prove that the positive zeros of J p (x) and J p+1 (x) occur alternately, in the 
sense that between each pair of consecutive positive zeros of either 
there is exactly one zero of the other. 

3. Express J 2 (x), J 3 (x), and J 4 (x) in terms of J 0 (x) and Jfx). 

4. If fix) is defined by 


1 


f(x) = < 


1 

2 

0 


0 < x < 


1 

2' 


x = 


1 

2' 


1 

2 


< x < 1, 


show that 


/(*)=£ 

n=l 


/i(A.„/2) 

Kh(K ) 2 


Jo(Kx), 


where the X n are the positive zeros of J 0 (x). 
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5. If f(x) = x 1 ’ for the interval 0 < x < 1, show that its Bessel series in the func¬ 
tions J p (X u x), where the X n are the positive zeros of J (x), is 


n =\ ^nJp+lQ^tt) 


J pfX n X). 


6. Use the notation of Problem 5 to show formally that if g(x) is a well- 
behaved function on the interval 0 < x < 1, then 

1 1 =0 1 

- [ x p+1 g(x ) dx = Y [ xg(x)J p (X„x) dx. 

2J 0 

By taking g(x) = x? and xp +2 , deduce that 

UiT4(p+i) and §^ = 16(p + l) 2 (p+2)- 

7. The positive zeros of sin x are it, 2%, 3it, ... Use the result of Problem 6 
(and Problem 46-5) to show that 


z 


n =1 


= 1 + 


l 

— + 
4 


(1 

9 


+ ••• 


6 


and 


V —= 1 — A ... = A 

A-?? 4 1 + 16 + 81 90' 

n =1 


8. Show that the change of dependent variable defined by 


By = 


1 du 
u dx 


transforms the special Riccati equation 

+ Bu 2 =Cx r 
dx 


into 

d ^-BCx m u = 0- 
dx 
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If m * -2, use Problem 46-7 to show that this equation is solvable in 
terms of elementary functions if and only if m = -4k/(2/c + 1) for some 
integer k. (When m = -2, the substitution y = n/x transforms Riccati's 
equation into an equation with separable variables that has an elemen¬ 
tary solution.) 

9. Show that the general solution of 


can be written as 


- 3/4 


y = x- 


1 2 


X \ + Cj 3/ 4 


cj- 


• 1/4 


( 1 


u 




Appendix A. Legendre Polynomials and Potential Theory 

If a number of particles of masses m v m 2 , ... m n/ attracting according to the 
inverse square law of gravitation, are placed at points P u P 2 , . .., P„, then the 
potential due to these particles at any point p (that is, the work done against 
their attractive forces in moving a unit mass from P to an infinite distance) is 


Gnii Gnu Gm n 

-i + - + ... + 

PPl PPl PPn 


(1) 


where G is the gravitational constant. 9 If the points P, P y P 2 , ..., P„ have rect¬ 
angular coordinates (x,y,z), {x v y v zf, (x 2 ,y 2 ,z 2 ), ■ ■ ■, (x„,y„,z, I ), so that 

ppi = >/(* _ Xi ) 2 +(y - yi) 2 +(z-zif, 


with similar expressions for the other distances, then it is easy to verify by 
partial differentiation that the potential U satisfies Laplace's equation: 


d 2 U d 2 U d 2 U 
dx 2 dy 2 dx 2 


(2) 


9 See equation 21-(17). 
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This partial differential equation does not involve either the particular 
masses or the coordinates of the points at which they are located, so it is sat¬ 
isfied by the potential produced in empty space by an arbitrary discrete or 
continuous distribution of particles. It is often written in the form 


V 2 U = 0, (3) 

where the symbol V 2 (del squared) is simply a concise notation for the dif¬ 
ferential operator 


d 2 d 2 d 2 
dx 2 dy 2 dz 2 

The function U is called a gravitational potential. If we work instead with 
charged particles of charges q v q 2 , ..., q n , then their electrostatic potential has 
the same form as (1) with the m's replaced by q's and G by Coulomb's con¬ 
stant, so it also satisfies Laplace's equation. This equation has such a wide 
variety of applications that its study is a branch of analysis in its own right, 
known as potential theory, the related equation 

» ,4) 

at 

called the heat equation, occurs in problems of heat conduction, where U 
is now a function of the time t as well as the space coordinates. The wave 
equation 


a 2 V 2 U 


d 2 U 

dt 2 


( 5 ) 


is connected with vibratory phenomena. 

We add a few brief comments on the physical meaning of equations (3) 
and (4). [Equation (5) is simply the three-dimensional counterpart of the 
one-dimensional wave equation 40-(8), which we have already discussed 
quite fully.] First, Laplace's equation (3) makes the same sort of statement 
about the function U as the one-dimensional equation d 2 y/dx 2 = 0 makes 
about a function y(x) of the single variable x. But the latter equation implies 
that y(x) has the linear form y = mx + b; and every such function has the 
property that its value at the center of an interval equals the average of 
its values at the endpoints. It is clear from (1) that solutions of Laplace's 
equation need not be linear functions of x, y, and z; in fact, they can be 
very complicated indeed. Nevertheless, it can be proved (and was discov¬ 
ered by Gauss) that any solution of (3) has the very remarkable property 
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that its value at the center of a sphere equals the average of its values on the 
surface of that sphere. 10 More generally, the function V 2 li can be thought 
of as a rough measure of the difference between the average value of U 
on the surface of a small sphere and its exact value at the center. Thus, for 
example, if U represents the temperature at an arbitrary point P in a solid 
body, and V 2 lt is positive at a certain point P 0 , then the value of U at P 0 is 
in general lower than its values at nearby points. We therefore expect heat 
to flow toward P 0 , raising the temperature there; and since the tempera¬ 
ture U is rising, dU/dt is positive at P 0 . This is essentially what the heat 
equation (4) says: that dU/dt is proportional to V 2 Lt and has the same sign. 
If the temperature U reaches a steady state throughout the body, so that 
dU/dt = 0 at all points, then V 2 lt = 0 and we are back to the case of Laplace's 
equation. 

We shall have occasion to use the formulas for V 2 Lt in cylindrical coordi¬ 
nates (r,0,z) and spherical coordinates (p,( ),<]>), as shown in Figure 61. These 
coordinates are related to rectangular coordinates by the equations 

x - r cos 0, y = r sin 0, z = z, 


and 


x - p sin 4> cos 0, y = p sin c|) sin 0, z = p cos (f>. 



FIGURE 61 


10 The two-dimensional version of this property is given in Problem 42-4. 
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By tedious but straightforward calculations one can show that in cylindrical 
coordinates. 


V 2 U 


d 2 U 1 dU 1 d 2 U d 2 U 
dr 2 r dr r 2 50 2 dz 2 ' 


( 6 ) 


and in spherical coordinates. 


V 2 U = ^ 

p- Sp 


,dU 
dp . 


f^d_ 

P 2 9p 


sin 


8U_ 

0(j) 


1 _ &U 

p 2 sin 2 4> 50 2 


(7) 


All students of mathematics or physics should carry out the necessary calcu¬ 
lations at least once in their lives, but perhaps once is enough! 


Steady-state temperatures in a sphere. Our purpose in this example is to 
illustrate as simply as possible the role of Legendre polynomials in solving 
certain boundary value problems of mathematical physics. 11 

Let a solid sphere of radius 1 be placed in a spherical coordinate system 
with its center at the origin. Let the surface be held at a specified temperature 
f(<$), which is assumed to be independent of 0 for the sake of simplicity, until 
the flow of heat produces a steady state for the temperature T(p, <{>) within the 
sphere. The problem is to find an explicit representation for the temperature 
function T(p, <|>). 

The steady-state temperature T satisfies Laplace's equation in spherical 
coordinates; and since T does not depend on 0, (7) allows us to write this 
equation in the form 


8 

dp 



81 ^ 
8P , 


1 8 
sin(|) d(|) 


f 

sin 

v 


dT 
3<t> j 


= 0. 


( 8 ) 


To solve (8) subject to the given boundary condition 

T(1,$)=W)’ (9) 

we use the method of separation of variables; that is, we seek a solution of (8) 
of the form T(p, <|>) = u(p)v(c|)). When this expression is inserted in (8) and the 
variables are separated, we obtain 


ld_ 
u dpy 


du 2 
dp , 


1 d ( . , dv 2 

-smip — 

csin(|) d(j) y di j> y 


( 10 ) 


11 Many problems of greater complexity are discussed in Lebedev, op. cit., chap. 8. 
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The crucial step in the method is the following observation: since the left 
side of equation (10) is a function of p alone and the right side is a function 
of 4> alone, each side must be constant. If this constant—called the separation 
constant —is denoted by X, then (10) splits into the two ordinary differential 
equations 

2 d 2 u du . n .... 

P tt + 2 P-, — Xu = 0 (11) 

dp dp 

and 


1 d 
sin 4> d (|> 


f 

sin 

v 


dv 


+ Xv = 0. 


( 12 ) 


Equation (11) is an Euler equation with p - 2 and q = -X, so its indicial equa¬ 
tion is 


m(m - 1) + 2m -X-0 or m 2 + m - X = 0. 


The exponents are therefore 
of (11) is 


— (—1 ± Vl + 4A, ), and the general solution 


u = Cip -V 2+ ^V5 +C2 p-V 2 -^ 


(13) 


or 


n = c 3 p~ 1/2 + c 4 p~ 1/2 log p. 

To guarantee that u is single-valued and bounded near p = 0, we discard the 

i I T 

second possibility altogether, and in (13) put c 2 = 0 and - — + JX + ^ = n where 
n is a non-negative integer. It follows that X = n(n + 1), so (13) reduces to 


u = Cjp" 


( 14 ) 


d 2 v cosd dv 

—t ;- 

sin 4) d(|) 


+ n(n + l)v = 0- 


and (12) becomes 
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If the independent variable is changed from c|) to x - cos cj), then this equation 
is transformed into 


(l-x 2 )‘^r-2x — + n(n + l)v = 0, (15) 

dx dx 

which is precisely Legendre's equation. By the physics of the problem, the 
function v must be bounded for 0 < <|> < it, or equivalently for -1 < x < 1; and we 
know from Section 44 that the only solutions of (15) with this property are 
constant multiples of the Legendre polynomials P n (x). If this result is com¬ 
bined with (14), then it follows that for each n = 0,1, 2, ..., we have particular 
solutions of (8) of the form 


a„p"P„(cos c|>), (16) 

where the a n are arbitrary constants. We cannot hope to satisfy the bound¬ 
ary condition (9) by using these solutions individually. However, Laplace's 
equation is linear and sums of solutions are also solutions, so it is natural to 
put the particular solutions (16) together into an infinite series and hope that 
T(p, (|>) can be expressed in the form 


T( p,4) = ^fl„p"P„(cos(K). (17) 

n =0 


The boundary condition (9) now requires that 


/w=y a „p„(cos^), 

n =0 


or equivalently that 


/(cos 1 x) - y^a n P n (x). (18) 

n =0 


We know from Section 45 that if the function/(cos 1 x) is sufficiently well 
behaved, then it can be expanded into a Legendre series of the form (18) 
where the coefficients a n are given by 


a„ = n + 


M J*/(cos 1 x)P„(x)dx. 


( 19 ) 


With these coefficients, (17) is the desired solution of our problem. 
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We have found the solution (17) by rather formal procedures, and it should 
be pointed out that there are difficult questions of pure mathematics involved 
here that we have not touched on at all. To a physicist, it may seem obvious 
that a solid body whose surface temperature is specified will actually attain 
a definite and unique steady-state temperature at every interior point, but 
mathematicians are unhappily aware that the obvious is often false. 12 The so- 
called Dirichlet problem of potential theory requires a rigorous proof of the 
existence and uniqueness of a potential function throughout a region that 
assumes given values on the boundary. This problem was solved in the early 
twentieth century by the great German mathematician Hilbert, for very gen¬ 
eral but precisely defined types of boundaries and boundary functions. 

The electrostatic dipole potential. The generating relation 


(1 - 2 xt + t 2 y 1/2 = y Pn (x)t n (20) 

n =0 


for the Legendre polynomials is discussed in Problems 44-1,44-2, and 44-3. 
As a direct physical illustration of its value, we use it to find the potential due 
to two point charges of equal magnitude q but opposite sign. If these charges 
are placed in a polar coordinate system (Figure 62), then with suitable units 
of measurement the potential at P is 

1 1 = 3 — 3 -, ( 21 ) 

h r 2 


p 



FIGURE 62 


12 Some fairly simple examples in which the statement just made is false are given in O. D. 
Kellogg, Foundations of Potential Theory, p. 285, Springer, New York, 1929. Einstein, a great 
maker of aphorisms, said: "The rarest and most valuable of all intellectual traits is the capac¬ 
ity to doubt the obvious." 
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where 

p = ^r 2 +a 2 -larcosQ and r 2 = ■yjr 2 + a 2 + 2or cos 9 
by the law of cosines. When r > a, we can use (20) to write 


1 _ 1 _ 1 _ 

h r iJl-2 cos 0(fl/r ) + (a/r) 2 



and similarly 


1 _ 1 _ 1 _ 

r 2 r ^1 + 2cos 0(fl/r) + (a/r) 2 

Formula (21) can now be written 


1 

r 


2>(-cos 0) 

n=0 



u = 


-J' [Pnicos 0)-P„(- COS 0)] 
r 


( 22 ) 


We know that the nth Legendre polynomial P n (x) is even if n is even and odd 
if n is odd. The bracketed expression therefore equals 0 or 2P,,(cos 0) accord¬ 
ing as n is even or odd, and (22) becomes 


Lt = ^X P -i(cos0)l 


a 
l r 


= 2i 

r 


P|(cOS 0)^ — j + -P 3 (cOS0)^ 


+ ••• 


(23) 


If we now assume that all terms except the first can be neglected when r is 
large compared with a, and recall that Pj(x) = x, then (23) yields 

U-2^} 


This is the approximation used by physicists for the dipole potential. 
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Appendix B. Bessel Functions and the Vibrating Membrane 

One of the simplest physical applications of Bessel functions occurs in Euler's 
theory of the vibrations of a circular membrane. In this context a membrane 
is understood to be a uniform thin sheet of flexible material pulled taut into 
a state of uniform tension and clamped along a given closed curve in the xy- 
plane. When this membrane is slightly displaced from its equilibrium posi¬ 
tion and then released, the restoring forces due to the deformation cause it to 
vibrate. Our problem is to analyze this vibrational motion. 

The equation of motion. Our discussion is similar to that given in Section 40 
for the vibrating string; that is, we make several simplifying assumptions 
that enable us to formulate a partial differential equation, and we hope that 
this equation describes the motion with a reasonable degree of accuracy. 
These assumptions can be summarized in a single statement: we consider 
only small oscillations of a freely vibrating membrane. The various ways in 
which this is used will appear as we proceed. 

First, we assume that the vibrations are so small that each point of the 
membrane moves only in the z direction, with displacement at time t given 
by some function z = z(x,y,f). We consider a small piece of the membrane 
(Figure 63) bounded by vertical planes through the following points in the 
xy-plane: ( x,y ), (x + A x,y), (x + kx,y + Ay), and x,y + Ay). If m is the constant 
mass per unit area, then the mass of this piece is m Ax Ay, and by Newton's 
second law of motion we see that 

d^z 

F = m Ax At/ — T (1) 

J dt 2 


is the force acting on it in the z direction. 



FIGURE 63 
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When the membrane is in its equilibrium position, the constant tension 
T has the following physical meaning: Along any line segment of length 
As, the material on one side exerts a force, normal to the segment and of 
magnitude T As, on the material on the other side. In this case the forces 
on opposite edges of our small piece are parallel to the xy-plane and cancel 
one another. When the membrane is curved, as in the frozen instant of 
motion shown in Figure 63, we assume that the deformation is so small 
that the tension is still T but now acts parallel to the tangent plane, and 
therefore has an appreciable vertical component. It is the curvature of our 
piece which produces different magnitudes for these vertical components 
on opposite edges, and this in turn is the source of the restoring forces that 
cause the motion. 

We analyze these forces by assuming that the piece of the membrane 
denoted by ABCD is only slightly tilted. This makes it possible to replace the 
sines of certain small angles by their tangents, as follows. Along the edges 
DC and AB, the forces are perpendicular to the x-axis and almost parallel to 
the y-axis, with small z components approximately equal to 


T Ax 


f a \ 
dz 

V dy Vy+Ay 


and -T Ax 


f a \ 
dz 


dy 


V n 


so their sum is approximately 


TAx 


f dz} 
j 


r ^ 
dz 

y+Ay V J y 


The subscripts on these partial derivatives indicate their values at the points 
(x,y + Ay) and (x,y). By working in the same way on the edges BC and AD, 
we find that the total force in the z direction (neglecting all external forces) 
is approximately 


F = TAy 




+ T Ax 


f a \ 

dz 

\ /y+Ay 


r dz' 

My 


so (1) can be written 


j ( dz/dx) x+Ax - (dz/dx) x | T (dz/gy) v+A y - ( dz/dy) y ^ d 2 z 

Ay dt 2 


Ax 
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If we now put a 2 = T/m and let Ax -> 0 and Ay -> 0, this becomes 


Vz d 2 z" 

y dx 2 + dy\ 


d 2 z 

dt 2 ' 


( 2 ) 


which is the two-dimensional wave equation. 

Students may be somewhat skeptical about the argument leading to equa¬ 
tion (2). If so, they have plenty of company; for the question of what consti¬ 
tutes a satisfactory derivation of the differential equation describing a given 
physical system is never easy, and is particularly baffling in the case of the 
wave equation. To give a more refined treatment of the limits involved would 
get us nowhere, since the membrane is ultimately atomic and not continu¬ 
ous at all. Perhaps the most reasonable attitude is to accept our discussion as 
a plausibility argument that suggests the wave equation as a mathematical 
model. We can then adopt this equation as an axiom of rational mechanics 
describing an "ideal membrane" whose mathematical behavior may or may 
not match the actual behavior of real membranes. 13 


The circular membrane. We now specialize to the case of a circular mem¬ 
brane, in which it is natural to use polar coordinates with the origin located 
at the center. Formula (6) of Appendix A shows that in this case the wave 
equation (2) takes the form 


2 f d 2 z 1 dz d 2 z ^ 

Cl - 7T H-1- 7T 

^ dr r dr d0 2 y 

where z = z(r,0,f) is a function of the polar coordinates and the time. For 
convenience we assume that the membrane has radius 1, and is therefore 
clamped to its plane of equilibrium along the circle r - 1 Accordingly, our 
boundary condition is 



z(l6,r) - 0. (4) 

The problem is to find a solution of (3) that satisfies this boundary condition 
and certain initial conditions to be specified later. 

In applying the standard method of separation of variables, we begin with 
a search for particular solutions of the form 

z(r,0,t) = u(r)v(&)w(t). (5) 


13 On the question, "What is rational mechanics?," we recommend the illuminating remarks of 
C. Truesdell, Essays in the History of Mechanics, pp. 334-340, Springer, New York, 1968. 
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When (5) is inserted in (3) and the result is rearranged, we get 

u"(r) | 1 u\r) [ 1 v"(Q) _ 1 w"(t) 
u{r) r u(r) r 2 v(fd) a 2 zv(t) 


Since the left side of equation (6) is a function only of r and 0, and the right 
side is a function only of t, both sides must equal a constant. For the mem¬ 
brane to vibrate, w(t) must be periodic; and the right side of (6) shows that in 
order to guarantee this, the separation constant must be negative. We there¬ 
fore equate each side of (6) to -X 2 with X > 0, and obtain the two equations 

w"(f) + X 2 a 2 w(f) = 0 

and 

u"(r) [ 1 u\r) | 1 P "(9) _ ? 2 
u(r) r u(r) r 2 17(0) 


( 7 ) 

( 8 ) 


It is clear that (7) has 

w(t) = c 1 cos Xat + c 2 sin Xat 
as its general solution, and (8) can be written as 

2 U’(r) u'(r) . 2 2 u"(9) 

r —— + r —— + X r =-—. 

u(r) u(r) 17(0) 


( 9 ) 


( 10 ) 


In (10) we have a function of r on the left and a function of 0 on the right, so 
again both sides must equal a constant. We now recall that the polar angle 
0 of a point in the plane is determined only up to an integral multiple of 
2k; and by the nature of our problem, the value of o at any point must be 
independent of the value of 0 used to describe that point. This requires that 
o must be either a constant or else nonconstant and periodic with period 2k. 
An inspection of the right side of equation (10) shows that these possibilities 
are covered by writing the separation constant in the form n 2 where n = 0,1, 
2, ..., and then (10) splits into 


o"(0) + n 2 v(d) - 0 


( 11 ) 


and 


r 2 u"(r) + ru'(r) + ( X 2 r 2 - n 2 )u(r) = 0 . 


( 12 ) 
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By recalling that o is either a constant or else nonconstant and periodic with 
period 2it, we see that (11) implies that 

d(0) = d 1 cos nQ + d 2 sin nQ (13) 

for each n, regardless of the fact that (13) is not the general solution of (11) 
when n = 0. Next, it is clear from Problem 46-7 that (12) is a slightly disguised 
form of Bessel's equation of order n, with a bounded solution J n (Xr) and an 
independent unbounded solution Y n (Xr). Since n(r) is necessarily bounded 
near r - 0, we discard the second solution and write 


W) = kJ n (Xr). (14) 

The boundary condition (4) can now be satisfied by requiring that u( 1) = 0 or 

m= o- (is) 

Thus the permissible values of X are the positive zeros of the function /„(*), 
and we know from Section 47 that J n (x) has an infinite number of such zeros. 
We therefore conclude that the particular solutions (5) yielded by this analy¬ 
sis are constant multiples of the doubly infinite array of functions 

J n (Xr)(d : cos nQ + d 2 sin nQ)(c 1 cos Xat + c 2 sin Xat), (16) 

where n = 0, 1, 2, ..and for each n the corresponding X's are the positive 
roots of (15). 

Special initial conditions. The above discussion is intended to show how 
Bessel functions of integral order arise in physical problems. It also demon¬ 
strates the significance of the positive zeros of these functions. For the sake 
of simplicity, we confine our further treatment to the following special case: 
the membrane is displaced into a shape z = f(r) independent of the variable 
0, and then released from rest at the instant t - 0 This means that we impose 
the initial conditions 


and 


z(r,0,O) =f(r) 


dz 
^ t=o 


= 0 . 


(17) 


(18) 


The problem is to determine the shape z(r,Q,i) at any subsequent time t > 0. 
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Our strategy is to adapt the particular solutions already found to the given 
initial conditions. First, the part of (17) that says that the initial shape is inde¬ 
pendent of 0 implies that o(0) is constant, so (13) tells us that n = 0. If the posi¬ 
tive zeros of J 0 (x) are denoted by /.„ a 2 ..., a,„ ..., then this remark reduces the 
array of functions (16) to 

Jo(K r )( c i cos X n at + c 2 sin X n at), n = 1,2, ... 

Next, (18) implies that c 2 = 0, and this leaves us with constant multiples of the 
functions 


J 0 {\„r) cos X n at,n = 1,2,.... 

Up to this point we have not used the fact that sums of solutions of (3) are 
also solutions. Accordingly, the most general formal solutions now available 
to us are the infinite series 


z = y]a„J 0 (X n r)cosX„at. (19) 

n=l 


Our final step is to try to satisfy (17) by putting t - 0 in (19) and equating the 
result to f(r): 


f(r) = ^a n J 0 (X n r). 

n =1 


The Bessel expansion theorem of Section 47 guarantees that this representa¬ 
tion is valid whenever/(r) is sufficiently well behaved, if the coefficients are 
defined by 


2 2 f rf(r)J 0 (h n r) dr. 

h\X,i) J Q 

With these coefficients, (19) is a formal solution of (3) that satisfies the 
given boundary condition and initial conditions, and this concludes our 
discussion. 14 


14 Many additional applications of Bessel functions can be found in Lebedev, op. cit., chap. 6. 
See also A. Gray and G. B. Mathews, A Treatise on Bessel Functions and Their Applications to 
Physics, Macmillan, New York, 1952. 
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Appendix C. Additional Properties of Bessel Functions 

In Sections 46 and 47 we had no space for several remarkable properties of 
Bessel functions that should not go unmentioned, so we present them here. 
Unfortunately, a full justification of our procedures requires several theo¬ 
rems from more advanced parts of analysis, but this does not detract from 
the validity of the results themselves. 

The generating function. The Bessel functions j n (x) of integral order are 
linked together by the fact that 

oo 

e (*/w-vt) = /o(x)+ Yj n (xw +(-i rn. (i) 

n =1 

Since J_ n (x) = (-1 ) n J n (x), this is often written in the form 

00 

e (x/2) ( t-i/f) = ^J n {x)t n . (2) 

n=— oo 

To establish (1), we formally multiply the two series 



The result is a so-called double series, whose terms are all possible products 
of a term from the first series and a term from the second. The fact that each 
of the series (3) is absolutely convergent permits us to conclude that this dou¬ 
ble series converges to the proper sum regardless of the order of its terms. 
For each fixed integer n > 0, we obtain a term of the double series containing 
t n precisely when j = n + k; and when all possible values of k are accounted 
for, the total coefficient of t n is 

V_ 1 _— (”!)* x * ■*- V( l) k ix/ 2 ) 2 k+n -J(:x) 

Zj(n + k)l2 n+k k\ 2 k ; *!(« + *)! hA ' 

Similarly, a term containing t~ n (n > 1) arises precisely when k = n + j, so the 
total coefficient of tr n is 

y I (yir xfff = y (x/2f’ +n 
Lfj\ 2> (n + j)\ 2 n+i j\(n + j)\ 

= (-i)7„(*). 


and the proof of (1) is complete. 
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A simple consequence of (2) is the addition formula 

oo 

Jn(x + y) — n-k(x)J fc(j/) • 

k=- oo 

To prove this, we notice first that 

oo 

g(*/2)(t-i/t)g(y/2)(t-i/t) _ g[(*+y)/2](f-i/t) = yj n(x + y)f «. 

«=—oo 

However, the product of the two exponentials on the left is also 


(4) 


2>)h 


^Jk(y)t k =]T Jn- k {x)J k {y) 

k =-oo n=-oo k=-co 


t n 
*- / 


and (4) follows at once on equating the coefficients of t n in these expressions. 
When n = 0, (4) can be written as 


Jo(x + y)= ^J-k(x) 

k =—oo 

00 00 

= Jo(x)J 0 (y)+ 1/ - k (x)Jk(y) + '2j k (x)J- k (iy) 

k =1 k=1 

00 

= Jo(x)J 0 (y) + '£ j (-V k [Jk(x)J k (y) + 7t(*)/t(y)] 

k =1 
00 

= /o(x)/o(y) + ^(-lf2/,(x)/,(y) 


or 


7o(* + y) = JoWJo(y) - 2/ 1 (x)/ 1 (y) + 2 ] 2 {x)] 2 {y) - ■ ■, (5) 

If we replace y by —x and use the fact that ff x) is even or odd according as n 
is even or odd, then (5) yields the remarkable identity 

1 = Jo( x ) 2 + 2Jfx) 2 + 2/ 2 (x) 2 + ■ ■ •, (6) 


which shows that |/ 0 (x)| < 1 and J„(x)|< l/V2 for n = 1, 2,.... 
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Bessel's integral formula. When t = e'°, the exponent on the left side of (2) 
becomes 


x 



ixsinO/ 


and (2) itself assumes the form 


^ixsinG_ 


2>y" e 


( 7 ) 


Since e“ sinB = cos (x sin 0) + i sin (x sin 0) and e'" 0 = cos n0 + i sin 770, equating 
real and imaginary parts in (7) yields 


cos(xsin0) = ^/„(x)cos77 0 


( 8 ) 


and 


sin (x sin 


oo 

0 ) =£/»(*) 

n =—oo 


sinn0. 


( 9 ) 


If we now use the relations J_„(x) = (-1 ) n J n (x), cos (-nQ) = cos nQ, and sin (—770) = 
-sin 770, then (8) and (9) become 


cos (xsin0) = / o (x) + 2^/ 2 „(x)cos2f70 (10) 

n= 1 

and 


sin (x sin 0) = 2^/ 2n -i(x)sin(277 -1)0. 


( 11 ) 


As a special case of (10), we note that 0 = 0 yields the interesting series 


1 - /„(*) + 2/ 2 (x) + 2 J 4 (x) + ■ ■ 
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Also, on putting 0 = it/2 in (10) and (11), we obtain the formulas 


cos x = J 0 (x) - 2/ 2 (x) + 2 J 4 (x) - 


and 


sin x = 2 Jfx) - 2/ 3 (x) + 2 / 5 (x) 


which demonstrate once again the close ties between the Bessel functions 
and the trigonometric functions. 

The most important application of (8) and (9) is to the proof of Bessel's inte¬ 
gral formula 



cos (n 0 - x sin 0) d0. 


o 


( 12 ) 


To establish this, we multiply (8) by cos mQ , (9) by sin mB, and add: 


cos(m0-xsin0) = ^/„(x)cos(m-w)0. 


When both sides of this are integrated from 0 = 0 to 0 = ji, the right side 
reduces to ji/,„(x), and replacing m by n yields formula (12). In his astronomi¬ 
cal work, Bessel encountered the functions /„(x) in the form of these integrals, 
and on this basis developed many of their properties. 15 


Some continued fractions. If we write the identity 47-(7) in the form 

Jp-i(x) ~ J P (x) — 7p+i(x), 

X 

then dividing by J p (x) yields 

_ 2p _ 1 _ 

Jp(x) x Jp(x)/J p+1 (x) 

When this formula is itself applied to the second denominator on the right, 
with p replaced by p + 1, and this process is continued indefinitely, we 
obtain 


15 For a description of Bessel's original problem, see Gray and Mathews, op. cit., pp. 4-7. 
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Jp-i(s) _ 2p _1_ 

J p (x ) x 2p + 2 1 

x 2p + 4 
x 

This is an infinite continued fraction expansion of the ratio J p i(x)// ( , (x). We 
cannot investigate the theory of such expansions here. Nevertheless, it may 
be of interest to point out that when p = 1/2, it follows from Problem 46-5 that 
J-i/ 2 (x)/J 1/2 (x) = cot x, so 


tanx = j-. 

x'3 1 

x 5_ 
x 

This continued fraction was discovered in 1761 by Lambert, who used it to 
prove that k is irrational. He reasoned as follows: If x is a nonzero rational 
number, then the form of this continued fraction implies that tan x cannot be 
rational; but tan it/4 = 1, so neither jt/4 nor it is rational. Several minor flaws 
in Lambert's argument were patched up by Legendre about 30 years later. 
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Chapter 9 

Laplace Transforms 


48 Introduction 

In recent years there has been a considerable growth of interest in the use of 
Laplace transforms as an efficient method for solving certain types of dif¬ 
ferential and integral equations. In addition to such applications, Laplace 
transforms also have a number of close connections with important parts 
of pure mathematics. We shall try to give the reader an adequate idea of 
some of these matters without dwelling too much on the analytic fine points 
and computational techniques that would be appropriate in a more extensive 
treatment. 

Before entering into the details, we offer a few general remarks aimed at 
placing the ideas of this chapter in their proper context. We begin by noting 
that the operation of differentiation transforms a function/(x) into another 
function, its derivative/'(x). If the letter D is used to denote differentiation, 
then this transformation can be written 

DIM =/'(*)• ( 1 ) 

Another important transformation of functions is that of integration: 


I[f(x)] = \f(t)dt. (2) 

o 

An even simpler transformation is the operation of multiplying all functions 
by a specific function g(x): 


M g \f(x)]=g(x)f(x). (3) 

The basic feature these examples have in common is that each transforma¬ 
tion operates on functions to produce other functions. It is clear that in most 
cases some restriction must be placed on the functions/(x) to which a given 
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transformation is applied. Thus, in (1) f(x) must be differentiable, and in (2) 
it must be integrable. In each of our examples, the function on the right is 
called the transform of f(x) under the corresponding transformation. 

A general transformation T of functions is said to be linear if the relation 

T[af(x) + Py(x)] = aT\f(x)] + pT[g(x)] (4) 

holds for all admissible functions f(x) and g(x) and all constants a and p. 
Verbally, equation (4) says that the transform of any linear combination of 
two functions is the same linear combination of their transforms. It is worth 
observing that (4) reduces to 

T[f(x)+g(x)] = T[f(x)] + T[g(x)] 


and 


T[af(x)] = aT\f(x)] 

when a = p = 1 and when p = 0. It is easy to see that the transformations 
defined by (1), (2), and (3) are all linear. 

A class of linear transformations of particular importance is that of the 
integral transformations. To get an idea of what these are, we consider func¬ 
tions/^) defined on a finite or infinite interval a<x<b, and we choose a fixed 
function K(p,x) of the variable x and a parameter p. Then the general integral 
transformation is given by 

b 

T[f(x)] = jK(p,x)f(x)dx = F(p). (5) 

a 

The function K(p,x) is called the kernel of the transformation T, and it is clear that 
T is linear regardless of the nature of K. The concept of a linear integral trans¬ 
formation, in generalized form, has been the source of some of the most fruitful 
ideas in modern analysis. Also, in classical analysis, various special cases of (5) 
have been minutely studied, and have led to specific transformations useful in 
handling particular types of problems. 

When a - 0, b = and K(p,x) = e~ px , we obtain the special case of (5) that 
concerns us—the Laplace transformation L, defined by 

oo 

L[f(x)] = ^J(x)dx = F(p). (6) 

o 

Thus, the Laplace transformation L acts on any function/(x) for which this 
integral exists, and produces its Laplace transform L\f(x)] = F(p), a function of 
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the parameter p. 1 We remind the reader that the improper integral in (6) is 
defined to be the following limit, and exists only when this limit exists: 


b 


00 



( 7 ) 


o 


0 


When the limit on the right exists, the improper integral on the left is said 

to converge. 

The following Laplace transforms are quite easy to compute: 



( 8 ) 


0 


00 



( 9 ) 


0 


oo 



( 10 ) 


0 


00 



( 11 ) 


00 



( 12 ) 


0 


00 



f(x) = cos ax. 


(13) 


The integral in (11) converges for p > a, and all the others converge for p> 0. 
Students should perform the necessary calculations themselves, so that the 
source of these restrictions on p is perfectly clear (see Problem 1). As an illus¬ 
tration, we provide the details for (10), in which n is assumed to be a positive 
integer: 


1 As this remark suggests, we shall consistently use small letters to denote functions of x and 
the corresponding capital letters to denote the transforms of these functions. 
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L[x"]=J, 


= \e px x n dx =- : 


= -L[x n ~ 1 ] = - 


n * 


f , \ 
n -1 


00 

+ -[ e - px X n - 1 dx 

pi 

Jo 0 

L[x n 


V r 
n\ 

.n +1 


It will be noted that we have made essential use here of the fact that 

x n 

lim-— = 0 forp>0. 

The above formulas will be found in Table 1 in Section 50. Additional simple 
transforms can readily be determined without integration by using the lin¬ 
earity of L, as in 


L[2x + 3] = 2L[x] + 3L[1] = \ + -. 

V V 

In later sections we shall develop methods for finding Laplace transforms of 
more complicated functions. 

As we stated above, the Laplace transformation L can be regarded as the 
special case of the general integral transformation (5) obtained by taking 
a = 0,b = and K(p,x) = e~P x . Why do we choose these limits and this particu¬ 
lar kernel? In order to see why this might be a fruitful choice, it is useful to 
consider a suggestive analogy with power series. 

If we write a power series in the form 


y>)x-, 

17=0 


then its natural analog is the improper integral 


J a(t)x‘dt. 

o 

We now change the notation slightly by writing x = e~ p , and this integral 
becomes 
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jV pf «(f)df, 

o 

which is precisely the Laplace transform of the function a(t). Laplace trans¬ 
forms are therefore the continuous analogs of power series; and since power 
series are important in analysis, we have reasonable grounds for expecting 
that Laplace transforms will also be important. 

A short account of Laplace is given in Appendix A. 


Problems 


1. Evaluate the integrals in (8), (9), (11), (12), and (13). 

2. Without integrating, show that 

(a) L[sinh ax\ = . a , , p > \a \; 

p -a 

V i i 

(b) L[coshflx] = ^i— j, p> \a\. 

p -a 

3. Find L[sin 2 ax] and L[cos 2 ax] without integrating. How are these two 
transforms related to one another? 

4. Use the formulas given in the text to find the transform of each of the 
following functions: 

(a) 10; 

(b) x 5 + cos 2x; 

(c) 2e 3x - sin 5x; 

(d) 4 sin x cos x + 2e~ x ; 

(e) x 6 sin 2 3x + x 6 cos 2 3x. 

5. Find a function/(x) whose transform is 



(b) 

(c) 

(d) 

(e) 


2 

p + 3' 

4 6 

3 2 A ' 

p p +4 
1 

2 ' 
p +p 
1 


3 4 + D 2 ’ 


6. Give a reasonable definition of —!. 

2 
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49 A Few Remarks on the Theory 

Before proceeding to the applications, it is desirable to consider more carefully the 
circumstances under which a function has a Laplace transform. A detailed and 
rigorous treatment of this problem would require familiarity with the general 
theory of improper integrals, which we do not assume. On the other hand, it is 
customary to give a brief introduction to this subject in elementary calculus, and 
a grasp of the following simple statements will suffice for our purposes. 

First, the integral 


jf(x)dx (1) 

0 


is said to converge if the limit 


b 



exists, and in this case the value of (1) is by definition the value of this limit: 


oo b 



Next, (1) converges whenever the integral 

oo 

J* \f(x)\ dx 

o 


converges, and in this case (1) is said to converge absolutely. And finally, 
(1) converges absolutely—and therefore converges—if there exists a function 
g(x) such that \f(x) \ < g(x) and 


jg(x)dx 

o 

converges (this is known as the comparison test). 

Accordingly, i f fix) is a given function defined for x > 0, the convergence 
of (1) requires first of all that the integral \lf(x)dx must exist for each finite 
b > 0. To guarantee this, it suffices to assume that fix) is continuous, or at 
least is piecewise continuous. By the latter we mean that f(x) is continuous over 
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FIGURE 64 

every finite interval 0 < x <b, except possibly at a finite number of points 
where there are jump discontinuities, at which the function approaches dif¬ 
ferent limits from the left and right. Figure 64 illustrates the appearance of 
a typical piecewise continuous function; its integral from 0 to b is the sum 
of the integrals of its continuous parts over the corresponding subintervals. 
This class of functions contains virtually all that are likely to arise in prac¬ 
tice. In particular, it includes the discontinuous step functions and sawtooth 
functions expressing the sudden application or removal of forces and volt¬ 
ages in problems of physics and engineering. 

If f(x) is piecewise continuous for x > 0, then the only remaining threat to 
the existence of its Laplace transform 

oo 

F(p) = je~ px f(x) dx 
o 

is the behavior of the integrand t^ x if(x) for large x. In order to make sure that 
this integrand diminishes rapidly enough for convergence—or that f(x) does 
not grow too rapidly—we shall further assume that f(x) is of exponential order. 
This means that there exist constants M and c such that 

\fix)\ < Me cx . (2) 

Thus, although/(x) may become infinitely large as x -* °°, it must grow less 
rapidly than a multiple of some exponential function e cx . It is clear that any 
bounded function is of exponential order with c = 0. As further examples, 
we mention e ax (with c- a) and x" (with c any positive number). On the other 
hand, e x is not of exponential order. \f fix) satisfies (2), then we have 

\e-v x f(x)\ <Mr^; 
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and since the integral of the function on the right converges for p > c, the 
Laplace transform of fix) converges absolutely for p > c. In addition, we note 
that 


so 


\m\ 


je px f(x)dx 
o 


< J|e px f(x)\dx 
o 


00 



F(p) —>• 0 as p -> °°. (3) 

Actually, it can be shown that (3) is true whenever F(p) exists, regardless of 
whether or not f(x) is piecewise continuous and of exponential order. Thus, if 
<|i(p) is a function of p with the property that its limit as p does not exist 
or is not equal to zero, then it cannot be the Laplace transform of any fix). 
In particular, polynomials in p, sin p, cos p, e p , and log p cannot be Laplace 
transforms. On the other hand, a rational function is a Laplace transform if 
the degree of the numerator is less than that of the denominator. 

The above remarks show that any piecewise continuous function of expo¬ 
nential order has a Laplace transform, so these conditions are sufficient for the 
existence of L[f(x)\. However, they are not necessary, as the exa mple fix) = x 1 /2 
shows. This function has an infinite discontinuity at x = 0, so it is not piece- 
wise continuous, but nevertheless its integral from 0 to b exists; and since it 
is bounded for large x, its Laplace transform exists. Indeed, for p > 0 we have 


L[x 1/2 ] = j*e rx x 1/2 dx, 
o 

and the change of variable px-t gives 


L[x- 1/2 ] = p- 1/2 jV f r 1/2 rft. 

0 

Another change of variable, t = s 2 , leads to 


L[x- 1/2 ] = 2p- 1/2 Je- s2 ds. 
0 


( 4 ) 
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In most treatments of elementary calculus it is shown that the last-written 
integral has the value Vtc/ 2 (see Problem 1), so we have 


L[x" 1/2 ] = 



( 5 ) 


This result will be useful in a later section. 

In the remainder of this chapter we shall concentrate on the uses of Laplace 
transforms, and will not attempt to study the purely mathematical theory 
behind our procedures. Naturally these procedures need justification, and 
readers who are impatient with formalism can find what they want in more 
extensive discussions of the subject. 


Problems 

1. If/denotes the integral in (4), then (s being a dummy variable) we can write 



~ d V = 


00 00 

j*jV (T +v ] dxdy. 

o o 


Evaluate this double integral by changing to polar coordinates, and 
thereby show that I = -Jn/l. 

2. In each of the following cases, graph the function and find its Laplace 
transform: 

(a) fix) = n(x - a) where a is a positive number and u{x) is the unit step 
function defined by 


u(x) = 


if x < 0 
if x > 0; 


(b) 

(c) 

(d) 


fix) = [x] where [x] denotes the greatest integer < x; 
fix) = x - [x]; 

f sin x if 0 < x < n 
10 if x > 7i. 


/(*) = 


2 

3. Show explicitly that L[e x ] does not exist. Hint: x 2 -px-(x- p/2) 2 - p 2 / 4. 

4. Show explicitly that L[x -1 ] does not exist. 
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5. Let e be a positive number and consider the function ffx) defined by 


m= 



if 0 < x < e 
if x > e. 


The graph of this function is shown in Figure 65. It is clear that for 
every e > 0 we have 1^ f e (x) dx = 1. Show that 

1 - 

Uf(x)] = 

pe 


and 


lim L[f(x)] = 1. 


Strictly speaking, lim, f ( x ) does not exist as a function, so 
L[lim, ^ 0 f (x)] is not defined; but if we throw caution to the winds, then 

8(x) = lim/ E (x) 


y 


1/e 


FIGURE 65 








Laplace Transforms 


457 


is seen to be some kind of quasi-function that is infinite at x - 0 and 
zero for x > 0, and has the properties 


J5(x)dx = l and L[S(x)] = l. 

o 

This quasi-function 5(x) is called the Dirac delta function or unit impulse 
function. 2 


50 Applications to Differential Equations 

Suppose we wish to find the particular solution of the differential equation 

y" + fly' + by =/(x) (1) 

that satisfies the initial conditions y(0) = y 0 and y'(0) = y' 0 . It is clear that we 
could try to apply the methods of Chapter 3 to find the general solution and 
then evaluate the arbitrary constants in accordance with the given initial 
conditions. However, the use of Laplace transforms provides an alternate 
way of attacking this problem that has several advantages. 

To see how this method works, let us apply the Laplace transformation L 
to both sides of (1): 


L[y" + fly' + by] = L[f(x)\. 

By the linearity of L, this can be written as 

L[y"\ + aL[y’] + bL[y\ = L[f(x)\. (2) 

Our next step is to express L[t/'] and L[y"] in terms of L[y\. First, an integration 
by parts gives 


2 P.A.M. Dirac (1902-1984) was an English theoretical physicist who won the Nobel Prize at the 
age of thirty-one for his work in quantum theory. There are several ways of making good 
mathematical sense out of his delta function. See, for example, I. Halperin, Introduction to the 
Theory of Distributions, University of Toronto Press, Toronto, 1952; or A. Erdelyi, Operational 
Calculus and Generalized Functions, Holt, New York, 1962. Dirac's own discussion of his func¬ 
tion is interesting and easy to read; see pp. 58-61 of his treatise The Principles of Quantum 
Mechanics, Oxford University Press, 4th ed., 1958. 
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oo 


L[y’] = y^y’dx 


0 


00 



0 


= -ij(0) + pL[y], 


so 


L[y'] = pL[y\-y( 0). 


( 3 ) 


Next, 


%"] = my') 1 ] = pJ-ly '] - y'( 0), 


so 


L[y"\=p 2 L[y\-py( O)-y'(O). 


( 4 ) 


If we now insert the given initial conditions in (3) and (4), and substitute 
these expressions in (2), we obtain an algebraic equation for L[y], 


p 2 L[y ]- pyo -y'o+ apL[y]-ay 0 + bL[y] = L[/(x)]; 


and solving for L[y] yields 


L[ j = £[/(*)]+(p + a)yo + yo 


( 5 ) 



The function/(x) is known, so its Laplace transform L[f(x)\ is a specific function 
of p ; and since a, b, y 0 , and y' 0 are known constants, L[y] is completely known 
as a function of p. If we can now find which function y(x) has the right side of 
equation (5) as its Laplace transform, then this function will be the solution 
of our problem—initial conditions and all. These procedures are particularly 
suited to solving equations of the form (1) in which the function/(x) is discon¬ 
tinuous, for in this case the methods of Chapter 3 may be difficult to apply. 

There is an obvious flaw in this discussion: in order for (2) to have any 
meaning, the functions/(x), y, y', and y" must have Laplace transforms. This 
difficulty can be remedied by simply assuming that /(x) is piecewise con¬ 
tinuous and of exponential order. Once this assumption is made, then it can 
be shown (we omit the proof) that y, y', and y" necessarily have the same 
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properties, so they also have Laplace transforms. Another difficulty is that 
in obtaining (3) and (4) we took it for granted that 

lim ye px = 0 and lim y'e~ px = 0. 


However, since y and if are automatically of exponential order, these state¬ 
ments are valid for all sufficiently large values of p. 


Example 1. Find the solution of 

y" + 4y = 4x (6) 

that satisfies the initial conditions y(0) = 1 and y'(0) = 5. 

When L is applied to both sides of (6), we get 

L[y"\ + 4 L[y] = 4L[x], (7) 

If we recall that L(x) = 1/p 2 , and use (4) and the initial conditions, then 
(7) becomes 


p 2 L[y] - p - 5 + 4L[y] 


so 


(p 2 + 4)L[y] = p + 5 + —, 


rr i P 5 4 


p-+ 4 p +4 p'(p + 4) 
p 5 11 

' _I_l_ 


H-5- 


/r + 4 p +4 p~ p +4 

p 4 1 

p 2 + 4 p 2 + 4 p 2 


( 8 ) 


On referring to the transforms obtained in Section 48, we see that (8) can 
be written 

L[y] = L[cos 2x] + L[2 sin 2x] + L[x] 

= Lfcos 2x + 2 sin 2x + x], 


y = cos 2x + 2 sin 2x + x 
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is the desired solution. We can easily check this result, for the general 
solution of (6) is seen by inspection to be 

y = Cj cos 2x + c 2 sin 2x + x, 

and the initial conditions imply at once that c 1 = 1 and c 2 = 2. 

The validity of this procedure clearly rests on the assumption that only 
one function y(x) has the right side of equation (8) as its Laplace trans¬ 
form. This is true if we restrict outselves to continuous y(x)'s —and any 
solution of a differential equation is necessarily continuous. When/(x) is 
assumed to be continuous, the equation L[f(x)] = F(p) is often written in 
the form 


L-'[F(P)] =/(x). 

It is customary to call D 1 the inverse Laplace transformation, and to refer 
to/(x) as the inverse Laplace transform of F(p). Since L is linear, it is evident 
that L _1 is also linear. In Example 1 we made use of the following inverse 
transforms: 


r 1 


v 

p 2 + 4 


= cos2x, 


r 1 


2 

p 2 +4 


= sin2x. 



= x. 


This example also illustrates the value of decomposition into partial 
fractions as a method of finding inverse transforms. 

For the convenience of the reader, we give a short list of useful trans¬ 
form pairs in Table 1. Much more extensive tables are available for the 
use of those who find it desirable to apply Laplace transforms frequently 
in their work. 

We shall consider a number of general properties of Laplace trans¬ 
forms that greatly increase the flexibility of Table 1. The first of these is 
the shifting formula: 


L[e“*f(x)] = F(p - a). (?) 

To establish this, it suffices to observe that 


L[e ax f(x)\ = jV p V x /(x)dx 
0 

00 

= je“ (p_ “)*/(x)dx 
0 

= F(p-a). 

Formula (9) can be used to find transforms of products of the form e ax f(x) 
when F(p) is known, and also to find inverse transforms of functions of 
the form F(p - a) when/(x) is known. 
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TABLE 1 

Simple Transform Pairs 


fix) F(p ) = Ll/Ml 


1 

1 


V 

X 

1 

P 2 

x n 

n\ 

7* 

e° x 

i 


p-a 

sin ax 

a 

p 2 + a 2 

cos ax 

V 

p 2 + a 2 

sinhflx 

a 

2 2 
p -a 

cosh ax 

V 

2 2 
p -a 


Example 2. 


so 


L[sin bx] 


b 


p + b 


L[e ax sinbx] 


b 

( p-a) 2 + b 2 


Example 3. 



so 


(; P~a) 


= e x. 


The methods of this section can be applied to systems of linear differ¬ 
ential equations with constant coefficients, and also to certain types of 
partial differential equations. Discussions of these further applications 
can be found in more extended works on Laplace transforms. 3 


3 For example, see R. V. Churchill, Operational Mathematics, 2d ed., McGraw-Hill, New York, 1958. 
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Problems 

1. Find the Laplace transforms of 

(a) x 5 e~ 2x -, 

(b) (l-x 2 )e~ x ; 

(c) e 3x cos lx. 

2. Find the inverse Laplace transforms of 


6 


(a) 


(p + 2f+9' 


1? 



.4 ' 


(0 . 

p 2 + 2p + 5 

3. Solve each of the following differential equations by the method of 
Laplace transforms: 

(a) if + y = 3e 2x , y(0) = 0; 

(b) y" - 4 y’ + 4y = 0, y(0) = 0 and y'(0) = 3; 

(c) y" + 2 y' + 2y = 2, y( 0) = 0 and y'(0) = 1; 

(d) y" + y' = 3x 2 , y(0) = 0 and y'(0) = 1; 

(e) y" + 2y' + 5y = 3e~ x sin x, y(0) = 0 and y'(0) = 3. 

4. Find the solution of y" - 2ay' + a 2 y - 0 in which the initial conditions 
y(0) = y 0 and y'(0) = y' 0 are left unrestricted. (This provides an additional 
derivation of our earlier solution, in Section 17, for the case in which the 
auxiliary equation has a double root.) 

5. Apply (3) to establish the formula for the Laplace transform of an 
integral. 


X 



and verify this by finding 


P(P + 1) 


6. Solve y' + 4y + 5 f ydx = e x , y(0) = 0. 


in two ways. 


o 
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51 Derivatives and Integrals of Laplace Transforms 

Consider the general Laplace transform formula 

oo 

F(p) = je- px f(x)dx. 
o 

The differentiation of this with respect to p under the integral sign can be 
justified, and yields 

oo 

F(p) = je-*(-x)f(x)dx (1) 

0 


or 

L[-xf(x)] = F'(p). (2) 

By differentiating (1), we find that 

L[xJ(x)]=F”(p), (3) 

and, more generally, that 

L[(-l)vf(x)] = F(">(p) (4) 

for any positive integer n. These formulas can be used to find transforms of 
functions of the form x'f(x) when F(p) is known. 


Example 1. Since L[sin ax\ = a/(p 2 + a 2 ), we have 


L[xsinax] 


d ( a 'j _ lap 
dpp 2 + a 2 J ( p 2 +a 2 ) 2 


Example 2. We know from Section 49 that L[x 1/2 ] = Jn/p, so 


L[x m ] = L[x(x~ V2 )] 


d_ 

dp 


( i—A 

71 





If we apply (2) to a function y(x ) and its derivatives—and remember for¬ 
mulas 50-(3) and 50-(4)—then we get 
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Uxtfl = --f %'] = ~\py ~ y(0)] = ~ \pY], (6) 

dp dp dp 


and 


L[x V "] = ~L[i/] = --f [p 2 Y-py(0)-y'(0)] 
dp dp 


d 


dp 


[p 2 Y - py(0)]. 


( 7 ) 


These formulas can sometimes be used to solve linear differential equa¬ 
tions whose coefficients are first degree polynomials in the independent 
variable. 


Example 3. Bessel's equation of order zero is 

xy" + y' +xy = 0. (8) 

It is known to have a single solution y(x) with the property that y(0) = 1. 
To find this solution, we apply L to (8) and use (5) and (7), which gives 

——[p 2 Y -p] + pY - 1 - ^ = 0 
dp V n f dp 


or 


(p 2 +V d ^=-v y 

dp 


( 9 ) 


If we separate the variables in (9) and integrate, we get 


y = -r=— = c(p 2 +l )~ 1/2 
VP + 1 


f y-i/2 

1 + - 

v p . 


( 10 ) 


On expanding the last factor by the binomial series 


(1 +zr=1+az+ «^ + «M z 3 + . 

2! 3! 

| a(a-l)--ia-n + l ) | 
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(10) becomes 


Y = - 


1 J_ + J_ 1 3 J__J_ 13 5 J_ + 

2 p 2 2! 2 2 p 4 3! 2 2 2 p 6 


l-3--5--(2n-l) (-1)" 
2 "h! p 2 ” 


-■Z 


(2«)! (-1)" 
-^2 2 "(h!) 2 p 2,I+1 ' 


If we now proceed formally, and compute the inverse transform of this 
series term by term, then we find that 


y(x) = 

n= 0 


(- 1 )” y*. 

2 2 "( h !) 2 


' x 2 x 4 x 6 

1 Y + 2T¥ 2 2 • 4 2 ■ 6 2 + ’’ 


Since y(0) = 1, it follows that c = 1, and our solution is 
, , 1 x 2 x 4 x 6 

y(x) = 1 -^ + ^2;^2 ~ 2 2 . 4 2 . 6 2 

This series defines the important Bessel function / 0 (x), whose Laplace 
transform we have found to be l/+1. We obtained this series in 
Chapter 8 in a totally different way, and it is interesting to see how easily 
it can be derived by Laplace transform methods. 


We now turn to the problem of integrating transforms, and our main result is 


f(x) 


oo 

:jF(p)dp. 


( 11 ) 


To establish this, we put L[f(x)/x\ = G(p). An application of (2) yields 


dG 


= L 


(-x) 


fix) 


= -L[/(x)] = -F(p), 


G(p) = -jf(p)dp 
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for some a. Since we want to make G(p) ->• 0 as p ^ we put a = °° and get 

oo 

G(p) = J* F(p)dp, 

V 

which is (11). This formula is useful in finding transforms of functions of the 
form f(x)/x when F(p) is known. Furthermore, if we write (11) as 



o 


jF(p)dp 

V 


and let p -» 0, we obtain 

00 00 

-dx = jp(p)dp, 
o o 


( 12 ) 


which is valid whenever the integral on the left exists. This formula can some¬ 
times be used to evaluate integrals that are difficult to handle by other methods. 


Example 4. Since L[sin x] = l/(p 2 + 1), (12) gives 


1 


sinx 

x 


dx = 


CO 



tan 1 p 


o 


71 

2 ' 


For easy reference, we list the main general properties of Laplace trans¬ 
forms in Table 2. It will be noted that the last item in the list is new. We 
shall discuss this formula and its applications in the next section. 


TABLE 2 


General Properties of L[f(x)] = F(p) 
L[af(x) + Pg(x)] = aF(p) + pG(p) 

L[e“/(x)] = F(p - a) 

L\f' (x)] = pF(p) -/(0); 

Hf'(x)] = p 2 F{p) - pf(0) -f (0) 



L[-xf(x)l = F’(p); 
L[(-iyx’f(x)] = FM(p) 

t [ i v]-J 


f f(x-t)g(t)dt 

Jo 


= F(p)G(p) 
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Problems 


1. Show that 


L[x cos ax] = —», 

(r+«) 


and use this result to find 


1 



2. Find each of the following transforms: 

(a) L[x 2 sin ax]-, 

(b) L[x 3/2 ]. 

3. Solve each of the following differential equations: 

(a) xy" + (3x - 1 )y' - (4x + 9 )y = 0, y( 0) = 0; 

(b) xy" + (2x + 3 )y' + (x + 3 )y = 3e~ x , y( 0) = 0. 

4. If y(x) satisfies the differential equation 


y" + xhy = 0, 


where y(0) = y 0 and y'( 0) = y' 0 , show that its transform Y(p) satisfies the 
equation 


Y” + p 2 Y = py 0 + y' 0 . 


Observe that the second equation is of the same type as the first, so that 
no progress has been made. The method of Example 3 is advantageous 
only when the coefficients are first degree polynomials. 

5. If a and b are positive constants, evaluate the following integrals: 




6. Show formally that 



n Jo 










468 


Differential Equations with Applications and Historical Notes 


7. If x > 0, show formally that 

(a) /W=r^ = f; 

Jot Z 

r, x rcosxt 71 

(b) /w= -rdt^-e*. 

Jo f + r 2 

8. (a) If fix) is periodic with period a, so that f(x + a) = fix), show that 

a 

E{ V )= ^ e ^Y VXfiX)dX - 

o 

(b) Find F(p) if f(x) =1 in the intervals from 0 to 1, 2 to 3,4 to 5, etc., and 
fix) = 0 in the remaining intervals. 


52 Convolutions and Abel's Mechanical Problem 

If L[f(x)] = F(p) and L[g(x)] = G(p), what is the inverse transform of F(p)G(p)7 
To answer this question formally, we use dummy variables s and t in the 
integrals defining the transforms and write 


F(p)G(p) = 

00 

J e~ ps f(s)ds 

00 

\e- pt g(t)dt 


_ 0 

_ 0 


= ^e- p{s+t) f(s)g(t)dsdt 
0 0 


00 00 

-//■ 


a -p( s +h 


/(s) ds 


g(t)dt, 


where the integration is extended over the first quadrant (s > 0, t > 0) in the 
sf-plane. We now introduce a new variable x in the inner integral of the last 
expression by putting s + t = x, so that s = x-t and ( t being fixed during this 
integration) ds = dx. This enables us to write 


F(p)G(p) = 


oo oo 

11 ' 


e px f(x-t)dx 


g(t)dt 


= jje- r ’ x f(x-t)g(t)dxdt. 
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FIGURE 66 


This integration is extended over the first half of the first quadrant (x -1 > 0) in 
the xt- plane, and reversing the order as suggested in Figure 66, we get 


F(p)G(p) = 


oo X 

11 ' 


e px f(x-t)g(t)dt 


dx 


00 

\ £VX 

1 

Jbo 

1 

I-!* 

0 

_0 


dx 


= L 


j f(x~t)g(t)dt 
.0 


( 1 ) 


The integral in the last expression is a function of the upper limit x, and pro¬ 
vides the answer to our question. This integral is called the convolution of the 
functions/(x) and g(x). It can be regarded as a "generalized product" of these 
functions. The fact stated in equation (1)—namely, that the product of the 
Laplace transforms of two functions is the transform of their convolution—is 
called the convolution theorem. 

The convolution theorem can be used to find inverse transforms. For 
instance, since L[x] = 1/p 2 and L[sin x] = l/(p 2 + 1), we have 
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1 

r 1 

l 

f l ^1 

LpV+1). 


[p 2 

Ip 2 +i J_ 


: J(x-f) 


-1) sin tdt 


= x - sm x, 


as can easily be verified by partial fractions. A more interesting class of appli¬ 
cations arises as follows. I ff(x) and k(x) are given functions, then the equation 

f(x) = y(x) + jk(x-t)y(t)dt, (2) 

o 

in which the unknown function y(x) appears under the integral sign, is called 
an integral equation. Because of its special form, in which the integral is the 
convolution of the two functions k(x) and y(x), this equation lends itself to 
solution by means of Laplace transforms. In fact, if we apply L to both sides 
of equation (2), we get 

L[f(x)] = L[ij(x)] + L[k(x)]L[ij(x)\, 


so 


L[y(x)] = 


JWx)] 

1 + L[k(x)] 


( 3 ) 


The right side of (3) is presumably known as a function of p; and if this func¬ 
tion is a recognizable transform, then we have our solution y(x). 


Example 1 . The integral equation 

X 

,w-e + J S m(«-() v m 

o 

is of this type, and by applying L we get 

L[y(.\')] = L[x 3 ] + L[sin x]L[y(x)]. 
Solving for L[i/(x)] yields 

L[x 3 ] _ 3 !/p 4 


L[t/(x)] = 


1 - L[sin x\ l-l/(p +1) 

Ol 


p Z +l 


_ 3! 3! 

P 4 + P 6 ‘ 


(4) 
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so 



is the solution of (4). 

As a further illustration of this technique, we analyze a classical problem 
in mechanics that leads to an integral equation of the above type. Consider a 
wire bent into a smooth curve (Figure 67) and let a bead of mass m start from 
rest and slide without friction down the wire to the origin under the action 
of its own weight. Suppose that ( x , y) is the starting point and (u, v) is any 
intermediate point. If the shape of the wire is specified by a given function 
y = y(x), then the total time of descent will be a definite function T(y) of the 
initial height y. Abel's mechanical problem is the converse: specify the function 
T(y) in advance and then find the shape of the wire that yields this T(y) as the 
total time of descent. 

To formulate this problem mathematically, we start with the principle of 
conservation of energy: 



which can be written as 


^2g(y-v )' 


y 



X 


FIGURE 67 
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On integrating this from v = y to v - 0, we get 


y=0 v=y 

T(y)= jdt= J 

v=y v=Q 


ds 

yl2g(y-v) 


1 f s'(v) dv 

Tpglyly 1 *' 


( 5 ) 


Now 


y 

s = s(y) = J 


= P + 

0 


f dx ^ 2 


dy 


dy 


is known whenever the curve y = y(x) is known, so its derivative 


f(y) = s'(y) = J i+ 


f dx' 2 
\ dx Jj 


( 6 ) 


is also known. If we insert (6) in (5), then we see that 


ny) = 


1 j* f{v) dv 


( 7 ) 


and this enables us to calculate T(y) whenever the curve is given. In Abel's 
problem we want to find the curve when T(y) is given; and from this point of 
view, the function/(y) in equation (7) is the unknown and (7) itself is called 
Abel’s integral equation. Note that the integral in (7) is the convolution of the 
functions y 1 /2 and/(y), so on applying the Laplace transformation L we get 


L[T(y)] = 




L[y- 1/2 ]L[/(y)]. 


If we now recall that L[y 1/2 ] = yjn/p, then this yields 

um)=& I W 


- WL 


p 1/2 L[T(y)J. 


( 8 ) 


When T(y) is given, the right side of equation (8) is known as a function of 
p, so hopefully we can find/(y) by taking the inverse transform. Once/(y) is 
known, the curve itself can be found by solving the differential equation (6). 
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As a concrete example, we now specialize our discussion to the case in 
which T(y) is a constant T 0 . This assumption means that the time of descent 
is to be independent of the starting point. The curve defined by this property 
is called the tautochrone, so our problem is that of finding the tautochrone. In 
this case, (8) becomes 


L[/m= 




where b = 2 gTf / n 1 . The inverse transform of -yjn/p is y~ 1/2 , so 


f(v) = 



( 9 ) 


With this/(y), (6) now yields 


1 + 


dx 

\ d Vj 


b 

V 


as the differential equation of the curve, so 


x = 




On substituting y = b sin 2 cf>, this becomes 

x = 2b j* cos 2 ()) d§ = bj*(l + cos 2(j>) d<\> 


= — (2(j) + sin 2(j)) + c. 


so 

x = ^ ( 24 » + sin 24 ») + c and y = ^( l-cos2())). (10) 

The curve must pass through the origin (0,0), so c - 0; and if we put a = b/2 
and 0 = 24 >, then (10) take the simpler form 

x-a(Q + sin 0) and y = a(l - cos 0). 
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\ 


X 


FIGURE 68 


These are the parametric equations of the cycloid shown in Figure 68, which 
is generated by a fixed point on a circle of radius a rolling under the horizon¬ 
tal dashed line y = 2a. Since 2a=b = IgTo /n * 1 2 , the diameter of the generating 
circle is determined by the constant time of descent. 

Accordingly, the tautochrone is a cycloid. In Problems 6-5 and 11-5 we 
verified this property of cycloids by other methods. Our present discussion 
has the advantage of enabling us to find the tautochrone without knowing in 
advance what the answer will be. 


Problems 

1. Find lOfl/lp 2 + a 2 ) 2 ] by convolution. (See Problem 51-1.) 

2. Solve each of the following integral equations: 


(a) y(x) = l-[ ( x-t)ij(t)dt ; 

Jo 



3. Deduce 



from equation (8), and use this to verify (9) when T(y) is a constant T 0 . 

4. Find the equation of the curve of descent if T(y) = kjy for some con¬ 
stant k. 
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5. Show that the differential equation 

y"+ a2 y = fix), y( o) - y'( o) = o, 

has 

X 

y(x) = — f/(f) sin a(x -t)dt 

a J 

o 


as its solution. 


53 More about Convolutions. The Unit 
Step and Impulse Functions 

In the preceding section we found that the product of the Laplace transforms 
of two functions is the transform of a certain combination of these functions 
called their convolution. If we use the time f as the independent variable and 
if the two functions are/(f) and g(t), then this convolution theorem [equation 
52-(l)[ can be expressed as follows: 


Lumm )=l 


t 

^f(t-x)g(x)dT 


(1) 


It is customary to denote the convolution of/(f) and g(t) by f(t)*g(t), so that 


f(t)*g(t) = ^f(t~x)g(T)dT. (2) 

0 


The convolution theorem (1) can then be written in the form 

mrm = ufiwmi o) 

Our purpose in this section is to discuss an application of this theorem that 
makes it possible to determine the response of a mechanical or electrical sys¬ 
tem to a general stimulus if its response to the unit step function is known. 
These ideas have important uses in electrical engineering and other areas of 
applied science. 

Any physical system capable of responding to a stimulus can be thought 
of as a device that transforms an input function (the stimulus) into an output 






476 


Differential Equations with Applications and Historical Notes 


function (the response). If we assume that all initial conditions are zero at the 
moment t = 0 when the input /(f) begins to act, then by setting up the dif¬ 
ferential equation that describes the system, operating on this equation with 
the Laplace transformation L, and solving for the transform of the output y(t), 
we obtain an equation of the form 


W)] = 


mm 

z{p) 


( 4 ) 


where z(p) is a polynomial whose coefficients depend only on the parameters 
of the system itself. This equation is the main source of the explicit formulas 
for y(t) that we obtain below with the aid of the convolution theorem. 

Let us be more specific. We seek solutions y(t) of the linear differential 
equation 


y" + ay' + by =/(f) (5) 

that satisfy the initial conditions 

y(0) = y'(0) = 0 (6) 

describing a mechanical or electrical system at rest in its equilibrium posi¬ 
tion. The input/(f) can be thought of as an impressed external force F or elec¬ 
tromotive force £ that begins to act at time f = 0, as discussed in Section 20. 
When this input is the unit step function u(t) defined in Problem 49-2(a), 
the solution (or output) y(t) is denoted by A(t) and called the indicial response-, 
that is. 


A” + aA' +bA = u(t). 

By applying the Laplace transformation L and using formulas (3) and (4) in 
Section 50, we obtain 


p 2 L[A] + apL[A] + bL[A] = L[u(t)] = 


so 


L[A] = 


1 1 

p p 2 + ap + b 


1 1 

V z(p)' 


( 7 ) 


where z(p) is defined by the last equality. We now apply L in the same way to 
the general equation (5), which yields (4); and dividing both sides of this by 
p and using (7) gives 
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-L[y] = ^—L[f] = L[A\L[fl 
V P z (p) 


The convolution theorem now enables us to write (8) in the form 



( 8 ) 


By using formula 50-(3) once more we get 


L[y]=pL 


= L 


t 

j*A(f - t)/(t) dx 
o 

t 

i lt jA(t-x)f(x)dx 


so 


(9) 

0 


By applying Leibniz's rule for differentiating integrals 4 to (9), we now have 


3/(0 = jA\t - t)/(t) dx + 

o 


Next, since L[A]L[f] - L[f]L[A], (8) also enables us to write 


1 

P 


L[y] = L[f(t)*A(t)] = L 


t 

1 


f(t-a)A(a) da 


( 10 ) 


4 Leibniz's rule states that if F(f) = \° u G{t,x)dx, where u and v are functions of t and x is a dummy 
variable, then 

V 

—F(t)= (—G(t,x)dx + G(t, v) — - G(t, u) —. 
dt Jdt dt dt 


See p. 613 of George F. Simmons, Calculus With Analytic Geometry, McGraw-Hill, New York, 
1985. 
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and by following the same reasoning as before, we obtain 

f 

J/(0 = J/'(f - ctM(ct) da + f{0)A{t). (11) 

o 

In formula (10) we notice that 71(0) = 0 because of the initial conditions (6); 
and (11) takes a more convenient form under the change of variable t = t - a. 
Our two formulas (10) and (11) for y(t) therefore become 

t 

y(f) = j>(f-T)/(T)dx (12) 

0 


and 


y(t) = jA(f-T)/'(T)dT + /(0)A(f). (13) 

0 

Each of these formulas provides a solution of (5) for a general input /(f) in 
terms of the indicial response A(t) to the unit step function. Formula (13) is 
sometimes called the principle of superposition; it has been variously attributed 
to the famous nineteenth century physicists James Clerk Maxwell and Ludwig 
Boltzmann, and also to the English applied mathematician Oliver Heaviside. 

Example 1. Use formula (13) to solve y" +y' - 6y = 2e 3f , where y(0) = y'(0) = 0. 

Here we have 


L[A(t)] = - 


P(P 2 +P~ 6)' 
so by partial fractions and inversion we find that 


A(t) = -— + —e~ 3t + —e 2f . 
6 15 10 


Sinee/(f) = 2e 3t ,f'(t) = 6e 3t and/(0) = 2, (13) gives 


»«-} 


o 

+ 2 


- 1 - e 

6 15 


-3(t-i) _ 




10 


6e 3z dx 


1 J_ 

6 + 15 


—e 2 ' 

10 


1 3i 1 _3f 2 2 f 

-e +—e --e . 
3 15 5 






Laplace Transforms 


479 


This solution can be verified by substituting directly in the given equa¬ 
tion, and also by solving the equation by the method already studied in 
Section 50. 

We can also use formula (12) to solve the equation in this example, but before 
doing this, it is desirable to express (12) in a simpler form. We accomplish this 
by using the unit impulse function 5(f) described in Problem 49-5. In physics, 
the impulse due to a constant force F acting over a time interval Af is defined to 
be F Af. The "function" 5(f) can be thought of as a limit of constant functions 
of unit impulse acting over shorter and shorter intervals of time; it is used to 
describe forces and voltages that act very suddenly, as in the case of a ham¬ 
mer blow on a mechanical system or a lightning stroke on a transmission line. 

For us, the essential property of 6(f) is that expressed by the equation 


L[8(0] = 1, 


obtained in Problem 49-5. When the input/(f) in the differential equation (5) 
is the unit impulse function 5(f), the output y(t) is denoted by h(t) and called 
the impulsive response. Applying L in this case yields 



(14) 


so 



By (7) and (14), 


L[A(f)] = -— = 


p z(p) p 


and it follows from Problem 50-5 that 



o 


This shows that A'(t) = h(t), so formula (12) becomes 


t 


y(f ) = jh(t - x)/ (t) di. 


(15) 


o 


Thus, the solution of (5) with a general input/(f) can be written as the convo¬ 
lution of the impulsive response h(t) with/(f). 
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Example 2. Consider again the equation y" + y' - 6y = 2e 31 solved in 
Example 1. We have 


h(t) = L~ 


(p + 3)(p-2) 


= — (e 2f -e~ 3 ‘), 
5 


so that 


y(t) = J |[e 2( '“ T) -e- 3{, - x) ]2e 3r dz 
o 


1 3; 1 _3; 2 21 

-e +—e --e , 
3 15 5 


as before. 

Remark 1. In complicated practical situations electrical engineers are some¬ 
times compelled to work with indicial or impulsive responses A(t) or h(t) 
that are only accessible experimentally, by means of oscilloscope pictures 
responding to generator-produced step functions or impulse functions. 
In such a case the output must be calculated from (13) or (15) by methods 
of graphical integration that permit the plotting of individual points on the 
output curve. For a discussion of these topics see Chapter 9 of W. D. Day, 
Introduction to Laplace Transformsfor Radio and Electronic Engineers, Interscience, 
New York, 1960. 

Remark 2. To form a more general view of the meaning of convolution let us 
consider a linear physical system in which the effect at the present time t of 
a small stimulus g(x) dx at any past time x is proportional to the size of the 
stimulus. We further assume that the proportionality factor depends only on 
the elapsed time t-x, and thus has the form f(t - t) The effect at the present 
time t is therefore 


f(t - r)g(r) dx. 

Since the system is linear, the total effect at the present time t due to the 
stimulus acting throughout the entire past history of the system is obtained 
by adding these separate effects, and this leads to the convolution integral 

t 

0 

The lower limit here is 0 because we assume that the stimulus started acting 
at time t = 0, that is, that g(x) = 0 for x < 0. The importance of convolution is 
difficult to exaggerate: it provides a reasonable way of taking account of the 
past in the study of wave motion, heat conduction, diffusion, and other areas 
of mathematical physics. 
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Problems 

1. Show that f(t) * g(t) - g(t) *f(t) directly from the definition (2), by intro¬ 
ducing a new dummy variable a - t - x. This shows that the opera¬ 
tion of forming convolutions is commutative. It is also associative and 
distributive: 


and 

[f(t) +g (t)]*h(t)=f(t)*h(t)+g(t)*h(t). 

An interesting discussion of the abstract properties of convolution is 
given by Mark Kac and Stanislaw Ulam on pp. 140-142 of Mathematics 
and Logic, New American Library, New York, 1969. 

2. Find the convolution of each of the following pairs of functions: 

(a) 1, sin at; 

(b) e at , e bt , where a * b; 

(c) t,e at ; 

(d) sin at, sin bt, where a*b. 

3. Verify the convolution theorem for each of the pairs of functions con¬ 
sidered in Problem 2. 

4. Use the methods of both Examples 1 and 2 to solve each of the follow¬ 
ing differential equations: 

(a) y" + 5y' + 6y = 5e 3t , y(0) = y'(0) = 0; 

(b) y" + y'-6y = t, y( 0) = y\ 0) = 0; 

(c) y" -y' = t 2 , y(0) = y'(0) = 0. 

5. When the polynomial z(p) has distinct real zeros a and b, so that 

1 1 A | B 

z(p) (P~ a )(p-b) P~ a P~b 

for suitable constants A and B, then 

h(t) = Ae ai + Be bt 

and (15) takes the form 

t 

y (t) = J/(x)[Ae‘ !( '- T) + Be Ht ^]dT. 

0 
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This is sometimes called the Heaviside expansion theorem. 

(a) Use this theorem to write the solution of y" + 3 y' + 2 y - f(t), 
y(0)=y'(0)=o. 

(b) Give an explicit evaluation of the solution in (a) for the cases/(f) = e 3t 
and/(f) = t. 

(c) Find the solutions in (b) by using the superposition principle (13). 

6. Formula (13) can also be derived from (4) as follows, without the use of 
Leibniz's rule for differentiating integrals: 


L[y(t)]= l ^ ( ^ = ^ ^ ■pL[f(t)] 
z(p) pz(p) 

= L[A(t)]-pL[f(t )] 

= L[A(t)]-{L[f'(t)] + fm 

= L[A(t)*f’(t)] + f(0)L[A(t)] 


= L 


t 

j*A(f - t)/'(t) dx + f(0)A(t) 


Check the steps. 

7. As we know from Section 20, the forced vibrations of an undamped 
spring-mass system are described by the differential equation 

Mi" + kx =/(f), 

where x(t) is the displacement and/(f) is the impressed external force or 
"forcing function." If x(0) = x'(0) = 0, find the functions A(t) and h(t) and 
write down the solution x(t) for any/(f). 

8. The current l(t) in an electric circuit with inductance L and resistance R 
is given by equation (4) in Section 13: 

L — + RI = E(t), 
dt V ' 

where E(t) is the impressed electromotive force. If 1(0) - 0, use the meth¬ 
ods of this section to find f(f) in each of the following cases: 

(a) E(t) = E 0 ii(t)- r 

(b) E(f) = E 0 5(f); 

(c) E(f) = E 0 sin cof. 
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Appendix A. Laplace 

Pierre Simon de Laplace (1749-1827) was a French mathematician and theo¬ 
retical astronomer who was so famous in his own time that he was known 
as the Newton of France. His main interests throughout his life were celestial 
mechanics, the theory of probability, and personal advancement. 

At the age of twenty-four he was already deeply engaged in the detailed 
application of Newton's law of gravitation to the solar system as a whole, in 
which the planets and their satellites are not governed by the sun alone but 
interact with one another in a bewildering variety of ways. Even Newton 
had been of the opinion that divine intervention would occasionally be 
needed to prevent this complex mechanism from degenerating into chaos. 
Laplace decided to seek reassurance elsewhere, and succeeded in proving 
that the ideal solar system of mathematics is a stable dynamical system 
that will endure unchanged for all time. This achievement was only one of 
the long series of triumphs recorded in his monumental treatise Mecanique 
Celeste (published in five volumes from 1799 to 1825), which summed up the 
work on gravitation of several generations of illustrious mathematicians. 
Unfortunately for his later reputation, he omitted all reference to the dis¬ 
coveries of his predecessors and contemporaries, and left it to be inferred 
that the ideas were entirely his own. Many anecdotes are associated with 
this work. One of the best known describes the occasion on which Napoleon 
tried to get a rise out of Laplace by protesting that he had written a huge 
book on the system of the world without once mentioning God as the author 
of the universe. Laplace is supposed to have replied, "Sire, I had no need of 
that hypothesis." The principal legacy of the Mecanique Celeste to later gen¬ 
erations lay in Laplace's wholesale development of potential theory, with its 
far-reaching implications for a dozen different branches of physical science 
ranging from gravitation and fluid mechanics to electro-magnetism and 
atomic physics. Even though he lifted the idea of the potential from Lagrange 
without acknowledgment, he exploited it so extensively that ever since his 
time the fundamental differential equation of potential theory has been 
known as Laplace's equation. 

His other masterpiece was the treatise Theorie Analytique des Probabilities 
(1812), in which he incorporated his own discoveries in probability from 
the preceding 40 years. Again he failed to acknowledge the many ideas 
of others he mixed in with his own; but even discounting this, his book is 
generally agreed to be the greatest contribution to this part of mathematics 
by any one man. In the introduction he says: "At bottom, the theory of 
probability is only common sense reduced to calculation." This may be 
so, but the following 700 pages of intricate analysis—in which he freely 
used Laplace transforms, generating functions, and many other highly 
nontrivial tools—has been said by some to surpass in complexity even the 
Mecanique Celeste. 
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After the French Revolution Laplace's political talents and greed for posi¬ 
tion came to full flower. His countrymen speak ironically of his "supple¬ 
ness" and "versatility" as a politician. What this really means is that each 
time there was a change of regime (and there were many), Laplace smoothly 
adapted himself by changing his principles—back and forth between fer¬ 
vent republicanism and fawning royalism—and each time he emerged with 
a better job and grander titles. He has been aptly compared with the apocry¬ 
phal Vicar of Bray in English literature, who was twice a Catholic and twice a 
Protestant. The Vicar is said to have replied as follows to the charge of being 
a turncoat: "Not so, neither, for if I changed my religion, I am sure I kept true 
to my principle, which is to live and die the Vicar of Bray." 

To balance his faults, Laplace was always generous in giving assistance 
and encouragement to younger scientists. From time to time he helped for¬ 
ward in their careers such men as the chemist Gay-Lussac, the traveler and 
naturalist Humboldt, the physicist Poisson, and—appropriately—the young 
Cauchy, who was destined to become one of the chief architects of nineteenth 
century mathematics. 


Appendix B. Abel 

Niels Henrik Abel (1802-1829) was one of the foremost mathematicians of 
the nineteenth century and probably the greatest genius produced by the 
Scandinavian countries. Along with his contemporaries Gauss and Cauchy, 
Abel was one of the pioneers in the development of modern mathematics, 
which is characterized by its insistence on rigorous proof. His career was a 
poignant blend of good-humored optimism under the strains of poverty and 
neglect, modest satisfaction in the many towering achievements of his brief 
maturity, and patient resignation in the face of an early death. 

Abel was one of six children in the family of a poor Norwegian country 
minister. His great abilities were recognized and encouraged by one of his 
teachers when he was only sixteen, and soon he was reading and digesting 
the works of Newton, Euler, and Lagrange. As a comment on this experience, 
he inserted the following marginal remark in one of his later mathematical 
notebooks: "It appears to me that if one wants to make progress in math¬ 
ematics, one should study the masters and not the pupils." When Abel was 
only eighteen his father died and left the family destitute. They subsisted by 
the aid of friends and neighbors, and somehow the boy, helped by contribu¬ 
tions from several professors, managed to enter the University of Oslo in 
1821. His earliest researches were published in 1823, and included his solu¬ 
tion of the classic tautochrone problem by means of the integral equation 
discussed in Section 52. This was the first solution of an equation of this 
kind, and foreshadowed the extensive development of integral equations in 
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the late nineteenth and early twentieth centuries. He also proved that the 
general fifth degree equation ax 5 + bx i + cx 3 + dx 2 + ex +/= 0 cannot be solved 
in terms of radicals, as is possible for equations of lower degree, and thus 
disposed of a problem that had baffled mathematicians for 300 years. He 
published his proof in a small pamphlet at his own expense. 

In his scientific development Abel soon outgrew Norway, and longed to 
visit France and Germany. With the backing of his friends and professors 
he applied to the government, and after the usual red tape and delays, he 
received a fellowship for a mathematical grand tour of the Continent. He 
spent most of his first year abroad in Berlin. Here he had the great good 
fortune to make the acquaintance of August Leopold Crelle, an enthusias¬ 
tic mathematical amateur who became his close friend, advisor, and protec¬ 
tor. In turn, Abel inspired Crelle to launch his famous journal fur die Reine 
und Angezvandte Mathematik, which was the world's first periodical devoted 
wholly to mathematical research. The first three volumes contained 22 con¬ 
tributions by Abel. 

Abel's early mathematical training had been exclusively in the older for¬ 
mal tradition of the eighteenth century, as typified by Euler. In Berlin he 
came under the influence of the new school of thought led by Gauss and 
Cauchy, which emphasized rigorous deduction as opposed to formal cal¬ 
culation. Except for Gauss's great work on the hypergeometric series, there 
were hardly any proofs in analysis that would be accepted as valid today. As 
Abel expressed it in a letter to a friend: "If you disregard the very simplest 
cases, there is in all of mathematics not a single infinite series whose sum 
has been rigorously determined. In other words, the most important parts of 
mathematics stand without a foundation." In this period he wrote his clas¬ 
sic study of the binomial series, in which he founded the general theory of 
convergence and gave the first satisfactory proof of the validity of this series 
expansion. 

Abel had sent to Gauss in Gottingen his pamphlet on the fifth degree equa¬ 
tion, hoping that it would serve as a kind of scientific passport. However, for 
some reason Gauss put it aside without looking at it, for it was found uncut 
among his papers after his death 30 years later. Unfortunately for both men, 
Abel felt that he had been snubbed, and decided to go on to Paris without 
visiting Gauss. 

In Paris he met Cauchy, Legendre, Dirichlet, and others, but these meet¬ 
ings were perfunctory and he was not recognized for what he was. He had 
already published a number of important articles in Crelle's Journal, but the 
French were hardly aware yet of the existence of this new periodical and 
Abel was much too shy to speak of his own work to people he scarcely knew. 
Soon after his arrival he finished his great Memoire sur line Propriety Generale 
d'une Classe Tres Etendue des Fonctions Transcendantes, which he regarded as 
his masterpiece. This work contains the discovery about integrals of alge¬ 
braic functions now known as Abel's theorem, and is the foundation for the 
later theory of Abelian integrals. Abelian functions, and much of algebraic 
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geometry. Decades later, Hermite is said to have remarked of this Memoire: 
"Abel has left mathematicians enough to keep them busy for 500 years." Jacobi 
described Abel's theorem as the greatest discovery in integral calculus of the 
nineteenth century. Abel submitted his manuscript to the French Academy. 
He hoped that it would bring him to the notice of the French mathemati¬ 
cians, but he waited in vain until his purse was empty and he was forced 
to return to Berlin. What happened was this: the manuscript was given to 
Cauchy and Legendre for examination; Cauchy took it home, mislaid it, and 
forgot all about it; and it was not published until 1841, when again the manu¬ 
script was lost before the proof sheets were read. The original finally turned 
up in Florence in 1952. 5 In Berlin, Abel finished his first revolutionary article 
on elliptic functions, a subject he had been working on for several years, and 
then went back to Norway, deeply in debt. 

He had expected on his return to be appointed to a professorship at the 
university, but once again his hopes were dashed. He lived by tutoring, and 
for a brief time held a substitute teaching positon. During this period he 
worked incessantly, mainly on the theory of the elliptic functions that he 
had discovered as the inverses of elliptic integrals. This theory quickly took 
its place as one of the major fields of nineteenth century analysis, with many 
applications to number theory, mathematical physics, and algebraic geom¬ 
etry. Meanwhile, Abel's fame had spread to all the mathematical centers of 
Europe and he stood among the elite of the world's mathematicians, but in 
his isolation he was unaware of it. By early 1829 the tuberculosis he con¬ 
tracted on his journey had progressed to the point where he was unable to 
work, and in the spring of that year he died, at the age of twenty-six. As an 
ironic postcript, shortly after his death Crelle wrote that his efforts had been 
successful, and that Abel would be appointed to the chair of mathematics in 
Berlin. 

Crelle eulogized Abel in his Journal as follows: "All of Abel's works carry 
the imprint of an ingenuity and force of thought which is amazing. One may 
say that he was able to penetrate all obstacles down to the very foundation 
of the problem, with a force which appeared irresistible... He distinguished 
himself equally by the purity and nobility of his character and by a rare 
modesty which made his person cherished to the same unusual degree as 
was his genius." Mathematicians, however, have their own ways of remem¬ 
bering their great men, and so we speak of Abel's integral equation. Abelian 
integrals and functions. Abelian groups, Abel's series, Abel's partial summa¬ 
tion formula, Abel's limit theorem in the theory of power series, and Abel 
summability. Few have had their names linked to so many concepts and 
theorems in modern mathematics, and what he might have accomplished in 
a normal lifetime is beyond conjecture. 


5 For the details of this astonishing story, see the fine book by O. Ore, Niels Henrik Abel: 
Mathematician Extraordinary, University of Minnesota Press, Minneapolis, 1957. 
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54 General Remarks on Systems 

One of the fundamental concepts of analysis is that of a system of n simul¬ 
taneous first order differential equations. If i/,(x), y 2 (x),..., y n (x) are unknown 
functions of a single independent variable x, then the most general system 
of interest to us is one in which their derivatives y\,y'i, ... ,y’„ are explicitly 
given as functions of x and y v y 2/ ..., y„: 

y'i = f\{x,y l ,y 2 , ... ,y„) 
y'i = Hx,y x ,y 2 , ... , y„) 

( 1 ) 


y'n = fn{x,yx,y 2 , ■■■ ,y„). 


Systems of differential equations arise quite naturally in many scientific 
problems. In Section 22 we used a system of two second order linear equa¬ 
tions to describe the motion of coupled harmonic oscillators; in the example 
below we shall see how they occur in connection with dynamical systems 
having several degrees of freedom; and in Section 57 we will use them to 
analyze a simple biological community composed of different species of ani¬ 
mals interacting with one another. 

An important mathematical reason for studying systems is that the single 
nth order equation 


yM=/(*,y,yyM) 

can always be regarded as a special case of (1). To see this, we put 

yi = y, y 2 =y'/ y n =y (n ~ 1) 


( 2 ) 

(3) 
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and observe that (2) is equivalent to the system 

yi = y 2 

y2 = y 3 

y'n = f(x,y lr y 2 , ... ,y„\ 


which is clearly a special case of (1). The statement that (2) and (4) are equiva¬ 
lent is understood to mean the following: if y(x) is a solution of equation (2), 
then the functions y x (x), y 2 {x),. .., y„(x) defined by (3) satisfy (4); and conversely, 
if yfx), y 2 (x),. ■., y„(x) satisfy (4), then y(x) = yfx) is a solution of (2). 

This reduction of an nth order equation to a system of n first order equa¬ 
tions has several advantages. We illustrate by considering the relation 
between the basic existence and uniqueness theorems for the system (1) and 
for equation (2). 

If a fixed point x = x 0 is chosen and the values of the unknown functions 

yi(x 0 )=a 1 , y 2 (x 0 )= a 2 , ..., y„(x 0 ) = a n (5) 

are assigned arbitrarily in such a way that the functions/,,/^.. .,/„ are defined, 
then (1) gives the values of the derivatives y;(x 0 ),y 2 (x 0 ), , y»(*o) The simi¬ 

larity between this situation and that discussed in Section 2 suggests the 
following analog of Picard's theorem. 


Theorem A. Let the functions f,f 2/ . . .,/„ and the partial derivatives df/dy y ... ,df/ 
dy n , ... ,df n /dy lr ... ,df„/dy n be continuous in a region R of(x, y v y 2 ,..., y n ) space. If 
(x 0 , a v a 2 ,..., a n ) is an interior point ofR, then the system (1) has a unique solutioii 
yfx), y 2 (x),..., y„(x) that satisfies the initial coiiditions (5). 

We will not prove this theorem, but instead remark that when the ground 
has been properly prepared, its proof is identical with that of Picard's theo¬ 
rem as given in Chapter 13. Furthermore, by virtue of the above reduction. 
Theorem A includes as a special case the following corresponding theorem 
for equation (2). 


Theorem B. Let the function f and the partial derivatives df/dy, df/dy',..., 3//3y (,,_1) 
be continuous in a region R of(x, y, y',..., y (,!_1) ) space. If (x 0 , a v a 2/ ..., «„) is an interior 
point ofR, then equation (2) has a unique solution y(x) that satisfies the initial condi¬ 
tions y(x 0 ) - y'(x 0 ) = a 2r .., y {n ~ l \x 0 ) = a n . 

As a further illustration of the value of reducing higher order equations to 
systems of first order equations, we consider the famous n-body problem of 
classical mechanics. 
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FIGURE 69 


Let n particles with masses m l be located at points (x„ i/„ z,) and assume 
that they attract one another according to Newton's law of gravitation. If is 
the distance between m, and nq, and if 0 is the angle from the positive x-axis 
to the segment joining them (Figure 69), then the x component of the force 
exerted on m, by m is 


Gm I m l 


cos 0 = 


Gm,mj(Xj -x,) 


r ? 

r ‘i 


where G is the gravitational constant. Since the sum of these components 
for all j / i equals mfFxfdt 2 ), we have n second order differential equations 


d 2 x, 


-<=r 


mfXj-Xj) 




and similarly 


d 2 yi [ ; y /»•('/ '/.) 

dt 2 ^ r» 




and 


d 2 Zi 

dt 2 
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If we put v Xi = dxj/dt , v yi = dyt/dt , and v Zl = dzi/dt, and apply the above reduc¬ 
tion, then we obtain a system of 6 n equations of the form (1) in the unknown 
functions x 1 , v xi ,..., x n , v Xn , y lt v yi , ... , y n , v y „, z lr v zi ,..., z n , v Zn . If we now make 
use of the fact that 


r$ = [(*/ - xf 1 2 + (y, - yff + (z, - z,-) 2 ] 3/2 , 

then Theorem A yields the following conclusion: if the initial positions and 
initial velocities of the particles, i.e., the values of the unknown functions at a 
certain instant t = t 0 , are given, and if the particles do not collide in the sense 
that the r y do not vanish, then their subsequent positions and velocities are 
uniquely determined. This conclusion underlies the once popular philoso¬ 
phy of mechanistic determinism, according to which the universe is nothing 
more than a gigantic machine whose future is inexorably fixed by its state at 
any given moment. 1 


Problems 

1. Replace each of the following differential equations by an equivalent 
system of first order equations: 

(a) y" -x 2 y' -xy = 0; 

(b) y^if-xfyf. 

2. If a particle of mass m moves in the xy-plane, its equations of motion are 

m^y = /(h^y) and m^ = g(t,x,y), 

where / and g represent the x and y components, respectively, of the 
force acting on the particle. Replace this system of two second order 
equations by an equivalent system of four first order equations of the 
form (1). 


1 It also led Sir James Jeans to define the universe as "a self-solving system of 6N simultaneous 
differential equations, where N is Eddington's number." Sir Arthur Eddington asserted (with 
more poetry than truth) that 


N = — x 136 x 2 256 

2 

is the total number of particles of matter in the universe. See Jeans, The Astronomical Horizon, 
Oxford University Press, London, 1945; or Eddington, The Expanding Universe, Cambridge 
University Press, London, 1952. 
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55 Linear Systems 

For the sake of convenience and clarity, we restrict our attention through 
the rest of this chapter to systems of only two first order equations in two 
unknown functions, of the form 


§» Ht.x.y) 

~rr = G(t,x,y). 
I at 


(i) 


The brace notation is used to emphasize the fact that the equations are linked 
together, and the choice of the letter t for the independent variable and x and 
y for the dependent variables is customary in this case for reasons that will 
appear later. 

In this and the next section we specialize even further, to linear systems, of 
the form 


— = ai(t)x + b 1 (t)y + /i(f) 
at 

wf = a 2 (0 X + b 2 (t)y + f 2 (t). 
dt 


( 2 ) 


We shall assume in the present discussion, and in the theorems stated below, 
that the functions a^t), b,(t), and f(t), i = 1 , 2 , are continuous on a certain closed 
interval [a, b] of the f-axis. If/i(f) and/ 2 (f) are identically zero, then the system 
(2) is called homogeneous-, otherwise it is said to be nonhomogeneous. A solu¬ 
tion of (2) on [a, b] is of course a pair of functions x(t) and y(t) that satisfy both 
equations of (2) throughout this interval. We shall write such a solution in 
the form 


x = x(t) 

y = y(0- 


Thus, it is easy to verify that the homogeneous linear system (with constant 
coefficients) 



( 3 ) 
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has both 


x = e l 


y = e 


,3f 


,3f 


and 



(4) 


as solutions on any closed interval. 

We now give a brief sketch of the general theory of the linear system (2). It 
will be observed that this theory is very similar to that of the second order 
linear equation as described in Sections 14 and 15. We begin by stating the 
following fundamental existence and uniqueness theorem, whose proof is 
given in Chapter 13. 

Theorem A. If t 0 is any point of the interval [a, b], and ifx 0 and y 0 are any numbers 
whatever, then (2) has one and only one solution 


x = x(t) 

y=y(t), 


valid throughout [a, b], such that x(t 0 )-x 0 and y(t Q )=y 0 . 

Our next step is to study the structure of the solutions of the homogeneous 
system obtained from (2) by removing the terms/! (f) and/ 2 (f): 


— = a 1 (t)x + b 1 (t)y 


dt 


(5) 



. dt 


It is obvious that (5) is satisfied by the so-called trivial solution, in which x(t) 
and y(t) are both identically zero. Our main tool in constructing more useful 
solutions is the next theorem. 

Theorem B. If the homogeneous system (5) has two solutions 



and 



( 6 ) 


on [a, b], then 


x = cpe/f) + c 2 x 2 (t) 
y = c 1 y 1 (t)+c 2 y 2 (t) 


( 7 ) 


is also a solution on [a, b]for any constants c v and c 2 . 
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Proof. The proof is a routine verification, and is left to the reader. 

The solution (7) is obtained from the pair of solutions (6) by multiplying 
the first by c v the second by c 2 , and adding; (7) is therefore called a linear com¬ 
bination of the solutions (6). With this terminology, we can restate Theorem 
B as follows: any linear combination of two solutions of the homogeneous 
system (5) is also a solution. Accordingly, (3) has 

[ x = Cie 3f + c 2 e 2 ' 

[y = Cie 3f + 2 c 2 e 2 ' ® 

as a solution for every choice of the constants c x and c 2 . 

The next question we must settle is that of whether (7) contains all solu¬ 
tions of (5) on [a, b], that is, whether it is the general solution of (5) on [a, b]. 
By Theorem A, (7) will be the general solution if the constants c x and c 2 can 
be chosen so as to satisfy arbitrary conditions x(t i} ) = x 0 and y(t 0 )-y 0 at an 
arbitrary point t 0 in [a, b], or equivalently, if the system of linear algebraic 
equations 


CiXfQ + c 2 x 2 (f 0 ) — x 0 
c i 3 /i(^o) + c 2 }/ 2 (^ 0 ) = Vo 

in the unknowns c 1 and c 2 can be solved for each t 0 in [a, b] and every pair of 
numbers x 0 and y 0 . By the elementary theory of determinants, this is possible 
whenever the determinant of the coefficients. 


W(f) = 


Xi(t) 

yi(f) 


*2 (f) 

1/2(0 


does not vanish on the interval [a, b\. This determinant is called the Wronskian 
of the two solutions (6) (see Problem 4), and the above remarks prove the next 
theorem. 


Theorem C. If the tzvo solutions (6) of the homogeneous system (5) have a Wronskian 
W(t) that does not vanish on [a, b], then (7) is the general solution of (5) on this 
interval. 


It follows from this theorem that (8) is the general solution of (3) on any closed 
interval, for the Wronskian of the two solutions (4) is 


W(f) = 


e 2f 

2e 2f 


= e 5f . 
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which never vanishes. It is useful to know, as this example suggests, that the 
vanishing or nonvanishing of the Wronskian W(t) of two solutions does not 
depend on the choice of t. To state it formally, we have 

Theorem D. lfW(t) is the Wronskian of the two solutions (6) of the homogeneous 
system (5), then W(t) is either identically zero or nowhere zero on [a, b]. 

Proof. A simple calculation shows that W(t) satisfies the first order differen¬ 
tial equation 

dW 

^T = [ aft) + b 2 {t)]W, (9) 

dr 


from which it follows that 


W(t) = cJ Mt)+bm]dt (10) 

for some constant c. The conclusion of the theorem is now evident from the 
fact that the exponential factor in (10) never vanishes on [a, b]. 

Theorem C provides an adequate means of verifying that (7) is the general 
solution of (5): show that the Wronskian W(t) of the two solutions (6) does 
not vanish. We now develop an equivalent test that is often more direct and 
convenient. 

The two solutions (6) are called linearly dependent on [a, b] if one is a con¬ 
stant multiple of the other in the sense that 

Xi(t) = kx 2 (t) x 2 (t) = kxft) 

or 

yft) = ky 2 (t) y 2 (t) = kyft) 

for some constant k and all t in [a, b\, and linearly independent if neither is a 
constant multiple of the other. It is clear that linear dependence is equivalent 
to the condition that there exist two constants c l and c 2 , at least one of which 
is not zero, such that 


Cixdf) + c 2 x 2 (t) = 0 

( 11 ) 

ciyft) + c 2 y 2 (f) = 0 
for all t in [a, b\. We now have the next theorem. 

Theorem E. If the two solutions (6) of the homogeneous system (5) are linearly inde¬ 
pendent on [a, b], then (7) is the general solution of (5) on this interval. 

Proof. In view of Theorems C and D, it suffices to show that the solutions (6) 
are linearly dependent if and only if their Wronskian W(f) is identically zero. 
We begin by assuming that they are linearly dependent, so that, say. 
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Then 


x 1 (t) =kx 2 (t) 
yft) = ky 2 (t). 

kx 2 (t) x 2 (t) 
tyi(t) 1 / 2(0 
= kx 2 {t)y 2 (t) - kx 2 (t)y 2 (t) = 0 


W(t) = 


Xi(t) x 2 (t) 
yi (0 1/2(0 


( 12 ) 


for all t in [a, b\. The same argument works equally well if the constant k is on 
the other side of equations (12). We now assume that W(t) is identically zero, 
and show that the solutions (6) are linearly dependent in the sense of equa¬ 
tions (11). Let t 0 be a fixed point in [a, b\. Since W(f 0 ) = 0, the system of linear 
algebraic equations 

Cl*l(0) + C 2 X 2^o) ~ 0 
ciyi(0)+ c 2}/2(^0 ) = 0 

has a solution c, c 2 in which these numbers are not both zero. Thus, the solu¬ 
tion of (5) given by 

(x = c 1 x 1 (t) + c 2 x 2 (t) ^ 

1j/ = c 1 i/i(0 + c 2 i/ 2 (0 

equals the trivial solution at t 0 . It now follows from the uniqueness part of 
Theorem A that (13) must equal the trivial solution throughout the interval 
[a, b], so (11) holds and the proof is complete. 


The value of this test is that in specific problems it is usually a simple matter 
of inspection to decide whether two solutions of (5) are linearly independent 
or not. 

We now return to the nonhomogeneous system (2) and conclude our dis¬ 
cussion with 


Theorem F .If the two solutions (6) of the homogeneous system (5) are linearly inde¬ 
pendent on [a, b], and if 

jx = x v (t) 

I y = y P (t) 


is any particidar solution of (2) on this interval, then 

\x = CiXft) + c 2 x 2 (t) + x f ,(t) 
\y = c 1 y 1 (t) + c 2 y 2 (t) + y p (t) 


( 14 ) 


is the general solution of (2) on [a, b\. 
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Proof. It suffices to show that if 


X = x(t) 

y = y(0 


is an arbitrary solution of (2), then 


X = x(t)-x p (t) 

y = y(0-y P (0 


is a solution of (5), and this we leave to the reader. 

The above treatment of the linear system (2) shows how its general solution 
(14) can be built up out of simpler pieces. But how do we find these pieces? 
Unfortunately—as in the case of second order linear equations—there does 
not exist any general method that always works. In the next section we dis¬ 
cuss an important special case in which this problem can be solved: that in 
which the coefficients aft) and b,(t), i = 1, 2, are constants. 


Problems 

1. Prove Theorem B. 

2. Finish the proof of Theorem F. 

3. Verify equation (9). 

4. Let the second order linear equation 


d * 1 2 3 4 x dx 

^4 + P(t) — + Q(t)x = 0 

dt 2 w dt 


0 


be reduced to the system 


dx 


% = -Q{t)x-P{t)y. 
dt 


If x,(f) and x 2 (t) are solutions of equation (*), and if 



and 
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are the corresponding solutions of (**), show that the Wronskian of the 
former in the sense of Section 15 is precisely the Wronskian of the latter 
in the sense of this section. 


5. (a) Show that 



and 



are solutions of the homogeneous system 
dx 


dt 


= x + 3 y 


dy 
. dt 


3 x + y. 


(b) Show in two ways that the given solutions of the system in (a) are 
linearly independent on every closed interval, and write the gen¬ 
eral solution of this system. 

(c) Find the particular solution 

Jx = x(t) 

|y = y(t) 


of this system for which x(0) = 5 and t/(0) = 1. 
6. (a) Show that 


x = 2e 4f 

y = 3e 4f 


and 



are solutions of the homogeneous system 


dx 
dt 
dy 
. dt 


x + 2 y 
3x + 2 y. 


(b) Show in two ways that the given solutions of the system in (a) are 
linearly independent on every closed interval, and write the gen¬ 
eral solution of this system. 

(c) Show that 


x = 3t-2 
y = -2t + 3 
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is a particular solution of the nonhomogeneous system 

dx _ , „ 

— = x + 2y + f- l 
dt 

< 

— = 3x + 2y - 5f - 2, 


and write the general solution of this system. 

7. Obtain the given solutions of the homogeneous system in Problem 6 

(a) by differentiating the first equation with respect to t and eliminat¬ 
ing y; 

(b) by differentiating the second equation with respect to t and elimi¬ 
nating x. 

8. Use a method suggested by Problem 7 to find the general solution of the 
system 


dx 

dt 

dy 

. dt 


= x + 


= y- 


y 


9. (a) Find the general solution of the system 


dx 
dt 
dy_ 
. dt 


= x 

= y- 


(b) Show that any second order equation obtained from the system in 
(a) is not equivalent to this system, in the sense that it has solu¬ 
tions that are not part of any solution of the system. Thus, although 
higher order equations are equivalent to systems, the reverse is not 
true, and systems are more general. 


56 Homogeneous Linear Systems with Constant Coefficients 

We are now in a position to give a complete explicit solution of the simple system 
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where a y by a 2 , and b 2 are given constants. Some of the problems at the end 
of the previous section illustrate a procedure that can often be applied to this 
case: differentiate one equation, eliminate one of the dependent variables, 
and solve the resulting second order linear equation. The method we now 
describe is based instead on constructing a pair of linearly independent solu¬ 
tions directly from the given system. 

If we recall that the exponential function has the property that its deriva¬ 
tives are constant multiples of the function itself, then (just as in Section 17) 
it is natural to seek solutions of (1) having the form 


J x = Ae mt 
|y = Be mt . 


( 2 ) 


If we substitute (2) into (1) we get 

Arne mt =a 1 Ae mt +b 1 Be mt 
Bme m, =a 2 Ae mt + b 2 Be mt ; 

and dividing by e mt yields the linear algebraic system 

(% - m)A + b 2 B = 0 
a 2 A + ( b 2 - m)B = 0 


in the unknowns A and B. It is clear that (3) has the trivial solution A=B = 0, 
which makes (2) the trivial solution of (1). Since we are looking for nontrivial 
solutions of (1), this is no help at all. However, we know that (3) has non¬ 
trivial solutions whenever the determinant of the coefficients vanishes, i.e., 
whenever 


a 1 -m 

U2 


h 

b 2 -m 


= 0 . 


When this determinant is expanded, we get the quadratic equation 


m 2 - («j + b 2 )m + (a 1 b 2 - a 2 b 2 ) = 0 (4) 

for the unknown m. By analogy with our previous work, we call this the aux¬ 
iliary equation of the system (1). Let m t and m 2 be the roots of (4). If we replace 
m in (3) by m v then we know that the resulting equations have a nontrivial 
solution A 1 By so 


J x = A,e mt 
{y = B 1 e mt 


( 5 ) 
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is a nontrivial solution of the system (1). By proceeding similarly with m 2 , we 
find another nontrivial solution 



( 6 ) 


In order to make sure that we obtain two linearly independent solutions— 
and hence the general solution—it is necessary to examine in detail each of 
the three possibilities for m l and m 2 . 

Distinct real roots. When m 1 and m 2 are distinct real numbers, then (5) and 
(6) are easily seen to be linearly independent (why?) and 


x = CiA 1 e mt + c 2 A 2 e m2t 
y = c 1 B 1 e mt + c 2 B 2 e m2t 


(7) 


is the general solution of (1). 


Example 1. In the case of the system 


. at 



( 8 ) 


(3) is 


(1 - m)A+B=0 


(9) 


4A + (-2 - m) B = 0. 


The auxiliary equation here is 


m 2 +m- 6 = 0 or (m + 3 )(m - 2) = 0, 


so m 1 and m 2 are -3 and 2. With m=- 3, (9) becomes 


4A + B = 0 


4A + B = 0. 


A simple nontrivial solution of this system is A = 1, B =-4, so we have 



( 10 ) 
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as a nontrivial solution of (8). With m = 2, (9) becomes 


-A + B = 0 


4A - 4B = 0, 


and a simple nontrivial solution is A = 1, B = 1. This yields 



( 11 ) 


as another solution of (8); and since it is clear that (10) and (11) are linearly 
independent. 


\x = c 1 e 31 + c 2 e 2t 
{y = -4c 1 e“ 3, +c 2 e 2 ' 


is the general solution of (8). 


Distinct complex roots. If m , and m 2 are distinct complex numbers, then 
they can be written in the form a ± ib where a and b are real numbers and 
b / 0. In this case we expect the A's and B's obtained from (3) to be complex 
numbers, and we have two linearly independent solutions 

jx = AW*** , jx = A;e (a - ii)t 

{ y = B\e (a+ib)t an { y = Bk ( *~ ft) ' . 

However, these are complex-valued solutions, and to extract real-valued 
solutions we proceed as follows. If we express the numbers Al and in the 
standard form A" t = A, + iA 2 and B t = B| + zB 2 , and use Euler's formula 17-(7), 
then the first of the solutions (13) can be written as 


jx = (Ai + iA 2 )e at (cos bt + i sin bt) 
[ y = (Bi + iB 2 )e at (cos bt + istnbt) 


or 


x = e at [(A 1 cos bt - A 2 sin bt) + i(A\ sin bt + A 2 cos bt)] 
y = e at [(B! cos bt - B 2 sin bt) + i(Bi sin bt + B 2 cos bt)]. 


( 14 ) 
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It is easy to see that if a pair of complex-valued functions is a solution of (1), 
in which the coefficients are real constants, then their two real parts and their 
two imaginary parts are real-valued solutions. It follows from this that (14) 
yields the two real-valued solutions 


jx - e at (A 1 cos bt - A 2 sin bt) 
[ y = e at (Bi cos bt - B 2 sin bt) 


and 


jx = e at {Ai sin bt + A 2 cos bt) 
jy = e at (B 1 sin bt + B 2 cos bt). 


(15) 


(16) 


It can be shown that these solutions are linearly independent (we ask the 
reader to prove this in Problem 3), so the general solution in this case is 

jx = e at [ci {Ai cos bt - A 2 sin bt) + c 2 (A 1 sin bt + A 2 cos bt)] 

| y = e‘“[c 1 (B 1 cos bt-B 2 sin bt) + c 2 (B 2 sin bt + B 2 cos bt)]. 

Since we have already found the general solution, it is not necessary to con¬ 
sider the second of the two solutions (13). 


Equal real roots. When m t and m 2 have the same value m, then (5) and (6) are 
not linearly independent and we essentially have only one solution 


x = Ae mt 
y = Be mt . 


(18) 


Our experience in Section 17 would lead us to expect a second linearly inde¬ 
pendent solution of the form 


jx = Ate mt 
jy = Bte ml . 

Unfortunately the matter is not quite as simple as this, and we must actually 
look for a second solution of the form 


jx = (Aj + A 2 t)e mt 

{y = (Bi + B 2 t)e mt , 


( 19 ) 
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so that the general solution is 

| x = c 1 Ae mt +c 2 (A 1 + A 2 t)e ml 

\y = c 1 Be m, +c 2 {B 1 + B 2 t)e mt . 2 ( ’ 

The constants A v A 2 , B v and B 2 are found by substituting (19) into the system 
(1). Instead of trying to carry this through in the general case, we illustrate 
the method by showing how it works in a simple example. 


Example 2. In the case of the system 

[— = 3x - 4y 
_ dt y 

di/ 

[dt J 

(3) is 

(3 - m)A - 4B = 0 
A + (-l-m)B = 0. 

The auxiliary equation is 

m 2 - 2m + 1 = 0 or (m - l) 2 = 0, 

which has equal real roots 1 and 1. With m = 1, (22) becomes 

2h-4B = 0 
A-2B = 0. 

A simple nontrivial solution of this system is A = 2, B = 1, so 

jx = 2e t 

h = e l 


( 21 ) 


( 22 ) 


(23) 


2 The only exception to this statement occurs when a 1 = b 2 =a and ci 2 = b 1 = 0, so that the auxiliary 
equation is m 2 - 2am+a 2 = 0, m = a, and the constants A and B in (18) are completely unre¬ 
stricted. In this case the general solution of (1) is obviously 

fx = c 1 e ml 
\y = c 2 e m ‘. 


and the system is said to be uncoupled (since each equation can be solved independently of the 
other). 
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is a nontrivial solution of (21). We now seek a second linearly indepen¬ 
dent solution of the form 

\x = (Ai+A 2 t)e‘ 

[y = (B 1 +B 2 t)e‘. 

When this is substituted into (21), we obtain 

(Aj + A 2 t + A 2 )e‘ = 3(Ai + A 2 t)e* - 4(B 2 + B 2 t)e‘ 

(Bi + B 2 t + B 2 )e ( — (Ai + A 2 f)t ,t — (Bj + B 2 f)c ( , 

which reduces at once to 

(2 A 2 -4B 2 )t + (2A 1 -A 2 -4B 1 ) = 0 
(A 2 -2B,)f + {Ax -2 Bx-B 2 ) = 0. 

Since these are to be identities in the variable t, we must have 

2A 2 - 4B 2 = 0 2Ax - A 2 - 4 Bx = 0 
A 2 - 2B 2 = 0, Ax - 2Bx -B 2 =0. 


The two equations on the left have A 2 = 2, B 2 = 1 as a simple nontrivial 
solution. With this, the two equations on the right become 

2A 1 -4B 1 = 2 

Ax- 2Bj = l, 

so we may take A t = 1, = 0. We now insert these numbers into (24) and 

obtain 


jx = (l + 2t)e‘ 

\y=te‘ 


as our second solution. It is obvious that (23) and (25) are linearly inde¬ 
pendent, so 


jx = 2 Cxe‘ + c 2 (l + 2t)e* 
[i/ = Cie f + c 2 te‘ 


is the general solution of the system (21). 


Systems of First Order Equations 


505 


Problems 

1. Use the methods described in this section to find the general solution 
of each of the following systems: 


(a) 


(b) 


(c) 


(d) 


(e) 


(f) 


dx 

— = -3x + 4 y 
dt 


dy 

dt 

dx 

dt 

dy 

dt 

dx 

dt 


= -2x + 3y; 
= 4x - 2y 
= 5x + 2y; 
= 5x + 4y 


dy 

— = -x + y; 

dt J 


dx 

It 

dy 

dt 

dx 

It 

dy 

dt 

dx 

dt 


= 4x-3y 
= 8x-6y; 
= 2x 

= 3 y; 

= -4x-y 


(g) 

(h) 


dy 
. dt 
dx 
dt 

. dt 
dx 
dt 


x-2 y; 
7x + 6y 
2x + 6y; 
-~x-2y 


— = 4x + 5y. 

[dt J 

2. Show that the condition a 1 b ] > 0 is sufficient, but not necessary, for the 
system (1) to have two real-valued linearly independent solutions of the 
form (2). 
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3. Show that the Wronskian of the two solutions (15) and (16) is given by 
tV(0 = (A 1 B 2 - A 2 Bfe 2at , 


and prove that A 1 B 2 - A 2 B 1 / 0. 

4. Show that in formula (20) the constants A 2 and B 2 satisfy the same lin¬ 
ear algebraic system as the constants A and B, and that consequently 
we may put A 2 =A and B 2 = B without any loss of generality 

5. Consider the nonhomogeneous linear system 


— = a 1 (t)x + bft)y + f 1 (t) 
at 

^- = a 2 (t)x + b 2 (t)y + f 2 (t) 
at 


n 


and the corresponding homogeneous system 
dx 

= a 1 (t)x + b 1 (t) V 

d ^- = a 2 (t)x + b 1 {t)y. 

L at 

(a) If 

fx = Xi (t) [ x = x 2 (t) 

\ ) and \ 

[y = yi(0 [y = yi{t) 


are linearly independent solutions of (**), so that 

fx = C 1 Xi(t) + C 2 X 2 (t) 

|y = ciyi(t) + c 2 y 2 (t) 

is its general solution, show that 

Jx = Vi(t)Xi(t) + v 2 (t)x 2 (t) 

\y = v 1 (t)y 1 (t) + v 2 (t)y 2 (t) 

will be a particular solution of (*) if the functions vft) and v 2 (t) sat¬ 
isfy the system 

v\x x + » 2 x 2 = f\ 
v' l y 1 + v' 2 y 2 = f 2 . 


This technique for finding particular solutions of nonhomogeneous 
linear systems is called the method of variation of parameters. 
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(b) Apply the method outlined in (a) to find a particular solution of the 
nonhomogeneous system 

— = x + t/-5f + 2 
dt J 

< 

— = 4x-2i/-8f-8, 

[dt 

whose corresponding homogeneous system is solved in Example 1. 


57 Nonlinear Systems. Volterra's Prey-Predator Equations 

Everyone knows that there is a constant struggle for survival among differ¬ 
ent species of animals living in the same environment. One kind of animal 
survives by eating another; a second, by developing methods of evasion to 
avoid being eaten; and so on. 

As a simple example of this universal conflict between the predator and 
its prey, let us imagine an island inhabited by foxes and rabbits. The foxes 
eat rabbits, and the rabbits eat clover. We assume that there is so much clo¬ 
ver that the rabbits always have an ample supply of food. When the rabbits 
are abundant, then the foxes flourish and their population grows. When the 
foxes become too numerous and eat too many rabbits, they enter a period 
of famine and their population begins to decline. As the foxes decrease, the 
rabbits become relatively safe and their population starts to increase again. 
This triggers a new increase in the fox population, and as time goes on we 
see an endlessly repeated cycle of interrelated increases and decreases in the 
populations of the two species. These fluctuations are represented graphi¬ 
cally in Figure 70, where the sizes of the populations are plotted against time. 



FIGURE 70 
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Problems of this kind have been studied by both mathematicians and biol¬ 
ogists, and it is quite interesting to see how the mathematical conclusions we 
shall develop confirm and extend the intuitive ideas arrived at in the preced¬ 
ing paragraph. In discussing the interaction between the foxes and the rab¬ 
bits, we shall follow the approach of Volterra, who initiated the quantitative 
treatment of such problems. 3 

If x is the number of rabbits at time f, then we should have 

— = ax, a> 0, 
dt 

as a consequence of the unlimited supply of clover, if the number y of foxes 
is zero. It is natural to assume that the number of encounters per unit time 
between rabbits and foxes is jointly proportional to x and y. If we further 
assume that a certain proportion of these encounters result in a rabbit being 
eaten, then we have 


dx , 

— = ax - bxy, 
dt 


a and b > 0. 


In the same way 


dy_ 


dt 


-cy+dxy, 


c and d > 0; 


for in the absence of rabbits the foxes die out, and their increase depends on 
the number of their encounters with rabbits. We therefore have the following 
nonlinear system describing the interaction of these two species: 


dx 

dt 

. dt 


x(a-by) 

-y(c-dx). 


( 1 ) 


3 Vito Volterra (1860-1940) was an eminent Italian mathematician. His early work on integral 
equations (together with that of Fredholm and Hilbert) began the full-scale development of 
linear analysis that dominated so much of mathematics during the first half of the twentieth 
century. His vigorous excursions in later life into mathematical biology enriched both mathe¬ 
matics and biology. For further details, see his Lecons sur la theorie mathematique de la lutte pour 
la vie, Gauthier-Villars, Paris, 1931; or A. J. Lotka, Elements of Mathematical Biology, pp. 88-94, 
Dover, New York, 1956. A modern discussion, with the Hudson's Bay Company data on the 
numbers of lynx and hares in Canada from 1847 to 1903, can be found in E. R. Leigh, "The 
Ecological Role of Volterra's Equations," in Some Mathematical Problems in Biology, American 
Mathematical Society, Providence, R.I., 1968. 
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Equations (1) are called Volterra's prey-predator equations. Unfortunately this 
system cannot be solved in terms of elementary functions. On the other 
hand, if we think of its unknown solution 

Jx = x(t) 

|y = y(f) 

as constituting the parametric equations of a curve in the xty-plane, then we 
can find the rectangular equation of this curve. On eliminating t in (1) by 
division, and separating the variables, we obtain 

(a-by)dy _ ( c-dx)dx 
y x 


Integration now yields 

a log y -by = -c log x+dx + log K 


or 


ya e -by _ Kx~ c e dx , (2) 

where the constant K is given by 

K = x c 0 y a 0 e- dx °- hvo 

in terms of the initial values of x and y. 

Although we cannot solve (2) for either x or y, we can determine points on 
the curve by an ingenious method due to Volterra. To do this, we equate the 
left and right sides of (2) to new variables z and w, and then plot the graphs 
C, and C 2 of the functions 


z -ya e -by and w-Kxr c e ix (3) 

as shown in Figure 71. Since z-vo, we are confined in the third quadrant to 
the dotted line L. To the maximum value of z given by the point A on C v 
there corresponds one y and—via M on L and the corresponding points A 
and A" on C 2 —two x's, and these determine the bounds between which x 
may vary. Similarly, the minimum value of w given by B on C 2 leads to N on 
L and hence to B' and B" on C, and these points determine the bounds for 
y. In this way we find the points P, P 2 and Q v Q 2 on the desired curve C 3 . 
Additional points are easily found by starting on L at a point R anywhere 
between M and N and projecting up to Cj and over to C 3 , and then over to 
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FIGURE 71 


C 2 and up to C 3 , as indicated in Figure 71. It is clear that changing the value 
of K raises or lowers the point B, and this expands or contracts the curve C 3 . 
Accordingly, when K is given various values, we obtain a family of ovals 
about the point S, which is all there is of C 3 when the minimum value of w 
equals the maximum value of z. 

We next show that as t increases, the corresponding point (x, y) on C 3 moves 
around the curve in a counterclockwise direction. To see this, we begin by 
noting that equations (1) give the horizontal and vertical components of the 
velocity of this point. A simple calculation based on formulas (3) shows that 
the point S has coordinates x-c/d, y=a/b. When x < c/d, it follows from the 
second equation of (1) that dy/dt is negative, so our point on C 3 moves down 
as it traverses the arc Q 2 P 2 Q V Similarly, it moves up along the arc Q 1 P 2 Q 2 , so 
the assertion is proved. 

Finally, we use the fox-rabbit problem to illustrate the important method of 
linearization. First, we observe that if the rabbit and fox populations are 

x = ^ and y= a (4) 

d b 

then the system (1) is satisfied and we have dx/dt = 0 and dy/dt - 0, so there are 
no increases or decreases in x or y. The populations (4) are called equilibrium 
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popidations, for x and y can maintain themselves indefinitely at these con¬ 
stant levels. It is obvious that this is the special case in which the minimum 
of w equals the maximum of z, so that the oval C 3 reduces to the point S. If we 
now return to the general case and put 

x = - + X and y = — + Y, 
d 

then X and Y can be thought of as the deviations of x and y from their equi¬ 
librium values. An easy calculation shows that if x and y in (1) are replaced 
by X and Y [which amounts to translating the point (c/d, a/b) to the origin] 
then (1) becomes 


' dX 

hr 


- — Y-bXY 


d 

dY 

ad 


— X + dXY. 

dt 

b 


(5) 


We now "linearize" by assuming that if X and Y are small, then the XY terms 
in (5) can be discarded without serious error. This assumption amounts to 
little more than a hope, but it does simplify (5) to a linear system 


dX _ be y 
dt d 
dY _ ad 


( 6 ) 


It is easy to find the general solution of (6), but it is even easier to eliminate t 
by division and obtain 


dY _ ad 2 X 
dX~ IfcY' 


whose solution is immediately seen to be 


ad 2 X 2 +b 2 cY 2 =C 2 . 


This is a family of ellipses surrounding the origin in the XY-plane. Since 
ellipses are qualitatively similar to the ovals of Figure 71, we have reasonable 
grounds for hoping that (6) is an acceptable approximation to (5). 

We trust that the reader agrees that the fox-rabbit problem is interesting for 
its own sake. Beyond this, however, we have come to appreciate the fact that 
nonlinear systems present us with problems of a different nature from those 
we have considered before. In studying a system like (1), we have learned to 
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direct our attention to the behavior of solutions near points in the xy-plane at 
which the right sides both vanish; we have seen why periodic solutions (i.e., 
those that yield simple closed curves like C 3 in Figure 71) are important and 
desirable; and we have a hint of a method for studying nonlinear systems by 
means of linear systems that approximate them. In the next chapter we shall 
study nonlinear systems more fully, and each of these themes will be worked 
out in greater detail and generality. 


Problems 

1. Eliminate y from the system (1) and obtain the nonlinear second order 
equation satisfied by the function x(t). 

2. Show that d * 1 2 y/dt 2 > 0 whenever dx/dt > 0. What is the meaning of this 
result in terms of Figure. 70? 




Chapter 11 

Nonlinear Equations 


58 Autonomous Systems. The Phase Plane and Its Phenomena 

There have been two major trends in the historical development of differ¬ 
ential equations. The first and oldest is characterized by attempts to find 
explicit solutions, either in closed form—which is rarely possible—or in 
terms of power series. In the second, one abandons all hope of solving equa¬ 
tions in any traditional sense, and instead concentrates on a search for quali¬ 
tative information about the general behavior of solutions. We applied this 
point of view to linear equations in Chapter 4. The qualitative theory of non¬ 
linear equations is totally different. It was founded by Poincare around 1880, 
in connection with his work in celestial mechanics, and since that time has 
been the object of steadily increasing interest on the part of both pure and 
applied mathematicians. 1 

The theory of linear differential equations has been studied deeply and 
extensively for the past 200 years, and is a fairly complete and well-rounded 
body of knowledge. However, very little of a general nature is known about 
nonlinear equations. Our purpose in this chapter is to survey some of the 
central ideas and methods of this subject, and also to demonstrate that it 
presents a wide variety of interesting and distinctive new phenomena that 
do not appear in the linear theory. The reader will be surprised to find that 
most of these phenomena can be treated quite easily without the aid of 
sophisticated mathematical machinery, and in fact require little more than 
elementary differential equations and two-dimensional vector algebra. 

Why should one be interested in nonlinear differential equations? The 
basic reason is that many physical systems—and the equations that describe 
them—are simply nonlinear from the outset. The usual linearizations are 
approximating devices that are partly confessions of defeat in the face of the 
original nonlinear problems and partly expressions of the practical view that 
half a loaf is better than none. It should be added at once that there are many 
physical situations in which a linear approximation is valuable and adequate 


1 See Appendix A for a general account of Poincare's work in mathematics and science. 
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for most purposes. This does not alter the fact that in many other situations 
linearization is unjustified. 2 

It is quite easy to give simple examples of problems that are essentially 
nonlinear. For instance, if x is the angle of deviation of an undamped pen¬ 
dulum of length a whose bob has mass m, then we saw in Section 5 that its 
equation of motion is 


d 2 x 

dt 2 


+ —sinx 
a 


0 ; 


( 1 ) 


and if there is present a damping force proportional to the velocity of the 
bob, then the equation becomes 


d 2 x 
dt 2 


+ 


c dx 

-h 

m dt 


Q 

— sin x = 0. 
a 


( 2 ) 


In the usual linearization we replace sin x by x, which is reasonable for small 
oscillations but amounts to a gross distortion when x is large. An example of 
a different type can be found in the theory of the vacuum tube, which leads 
to the important van der Pol equation 

d x , t .. dx n . 

— r +p(x -1)—+ x = 0. (3) 

dr dt 

It will be seen later that each of these nonlinear equations has interesting 
properties not shared by the others. 

Throughout this chapter we shall be concerned with second order nonlin¬ 
ear equations of the form 


d 2 x 

It 2 



(4) 


which includes equations (1), (2), and (3) as special cases. If we imagine a 
simple dynamical system consisting of a particle of unit mass moving on 
the x-axis, and if/(x, dx/dt) is the force acting on it, then (4) is the equation of 
motion. The values of x (position) and dx/dt (velocity), which at each instant 
characterize the state of the system, are called its phases, and the plane of the 
variables x and dx/dt is called the phase plane. If we introduce the variable 
y = dx/dt, then (4) can be replaced by the equivalent system 


2 It has even been suggested by Einstein that since the basic equations of physics are nonlinear, 
all of mathematical physics will have to be done over again. If his crystal ball was clear on 
the day he said this, the mathematics of the future will certainly be very different from that 
of the past and present. 
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dx 



(5) 


We shall see that a good deal can be learned about the solutions of (4) 
by studying the solutions of (5). When t is regarded as a parameter, then 
in general a solution of (5) is a pair of functions x(t ) and y(t) defining a 
curve in the xy-plane, which is simply the phase plane mentioned above. 
We shall be interested in the total picture formed by these curves in the 
phase plane. 

More generally, we study systems of the form 



( 6 ) 


where F and G are continuous and have continuous first partial derivatives 
throughout the plane. A system of this kind, in which the independent vari¬ 
able t does not appear in the functions F and G on the right, is said to be 
autonomous. We now turn to a closer examination of the solutions of such a 
system. 

It follows from our assumptions and Theorem 54-A that if t Q is any num¬ 
ber and {x 0 ,y 0 ) is any point in the phase plane, then there exists a unique 
solution 



(7) 


of (6) such that x(t 0 )-x 0 and y(t 0 )-y 0 . If x(f) and y(t ) are not both constant 
functions, then (7) defines a curve in the phase plane called a path of the sys¬ 
tem. 3 It is clear that if (7) is a solution of (6), then 


X = x(t + c) 

y = y(t+c) 


( 8 ) 


is also a solution for any constant c. Thus each path is represented by 
many solutions, which differ from one another only by a translation of 


3 The terms trajectory and characteristic are used by some writers. 
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the parameter. Also, it is quite easy to prove (see Problem 2) that any path 
through the point (x 0 ,y Q ) must correspond to a solution of the form (8). It fol¬ 
lows from this that at most one path passes through each point of the phase 
plane. Furthermore, the direction of increasing t along a given path is the 
same for all solutions representing the path. A path is therefore a directed 
curve, and in our figures we shall use arrows to indicate the direction in 
which the path is traced out as t increases. 

The above remarks show that in general the paths of (6) cover the entire 
phase plane and do not intersect one another. The only exceptions to this 
statement occur at points (x 0 ,y 0 ) where both F and G vanish: 

Hx 0 ,y 0 ) = 0 and G(x 0 ,y 0 ) = 0. 

These points are called critical points, and at such a point the unique solu¬ 
tion guaranteed by Theorem 54-A is the constant solution x = x 0 and y = y 0 . 
A constant solution does not define a path, and therefore no path goes 
through a critical point. In our work we will always assume that each critical 
point (x 0 ,y 0 ) is isolated, in the sense that there exists a circle centered on (x 0 ,y 0 ) 
that contains no other critical point. 

In order to obtain a physical interpretation of critical points, let us consider 
the special autonomous system (5) arising from the dynamical equation (4). 
In this case a critical point is a point (x 0 ,0) at which y = 0 and f(x 0 , 0) = 0; that 
is, it corresponds to a state of the particle's motion in which both the velocity 
dx/dt and the acceleration dy/dt = d 2 x/dt 2 vanish. This means that the particle 
is at rest with no force acting on it, and is therefore in a state of equilibrium. 4 
It is obvious that the states of equilibrium of a physical system are among its 
most important features, and this accounts in part for our interest in critical 
points. 

The general autonomous system (6) does not necessarily arise from any 
dynamical equation of the form (4). What sort of physical meaning can be 
attached to the paths and critical points in this case? Here it is convenient to 
consider Figure 72 and the two-dimensional vector field defined by 

V(x,y) = F(x,y) i + G(x,y)j, 

which at a typical point P = ( x,y ) has horizontal component F(x,y) and verti¬ 
cal component G(x,y). Since dx/dt = F and dy/dt = G, this vector is tangent to 
the path at P and points in the direction of increasing t. If we think of t as 
time, then V can be interpreted as the velocity vector of a particle moving 
along the path. We can also imagine that the entire phase plane is filled with 
particles, and that each path is the trail of a moving particle preceded and 
followed by many others on the same path and accompanied by yet others 
on nearby paths. This situation can be described as a two-dimensional fluid 


4 For this reason, some writers use the term equilibrium point instead of critical point. 
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FIGURE 72 


motion ; and since the system (6) is autonomous, which means that the vector 
V(x,y) at a fixed point (x,y) does not change with time, the fluid motion is sta¬ 
tionary. The paths are the trajectories of the moving particles, and the critical 
points Q, R, and S are points of zero velocity where the particles are at rest 
(i.e., stagnation points of the fluid motion). 

The most striking features of the fluid motion illustrated in Figure 72 are: 

(a) the critical points; 

(b) the arrangement of the paths near critical points; 

(c) the stability or instability of critical points, that is, whether a particle 
near such a point remains near or wanders off into another part of 
the plane; 

(d) closed paths (like C in the figure), which correspond to periodic 
solutions. 

These features constitute a major part of the phase portrait (or overall pic¬ 
ture of the paths) of the system (6). Since in general nonlinear equations and 
systems cannot be solved explicitly, the purpose of the qualitative theory 
discussed in this chapter is to discover as much as possible about the phase 
portrait directly from the functions F and G. To gain some insight into the 
sort of information we might hope to obtain, observe that if x(t) is a periodic 
solution of the dynamical equation (4), then its derivative y(t) = dx/dt is also 
periodic and the corresponding path of the system (5) is therefore closed. 
Conversely, if any path of (5) is closed, then (4) has a periodic solution. As a 
concrete example of the application of this idea, we point out that the van der 
Pol equation—which cannot be solved—can nevertheless be shown to have a 
unique periodic solution (if g > 0) by showing that its equivalent autonomous 
system has a unique closed path. 
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Problems 

1. Derive equation (2) by applying Newton's second law of motion to the 
bob of the pendulum. 

2. Let (x 0 ,y 0 ) be a point in the phase plane. If xft) yft) and x 2 (t), y 2 (t) are 
solutions of (6) such that xftf = x 0l yft^) -y 0 and x 2 (f 2 ) =x 0 , y 2 (t 2 )-y 0 for 
suitable t 1 and f 2 , show that there exists a constant c such that 

xft + c) = x 2 (t) and y 1 (t+c)=y 2 (t). 

3. Describe the relation between the phase portraits of the systems 


f eF <^> 


and 




4. Describe the phase portrait of each of the following systems: 
dx 


(a) 


, =0 
dt 

— = 0 ; 

dt 


(b) 


dx 

— = x 

dt 

— = 0 ; 

dt 


(c) 


= 1 


dx 
dt 

o. 

dt 


(d) 


dx 

dt 

dy_ 

dt 


- = -x 


= -y- 


5. The critical points and paths of equation (4) are by definition those of 
the equivalent system (5). Find the critical points of equations (1), (2), 
and (3). 
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6. Find the critical points of 



dt 



7. Find all solutions of the nonautonomous system 


dx 

dt 


= x 


dy_ 
. dt 


and sketch (in the xy-plane) some of the curves defined by these 
solutions. 


59 Types of Critical Points. Stability 

Consider an autonomous system 



( 1 ) 


We assume, as usual, that the functions F and G are continuous and have 
continuous first partial derivatives throughout the xy-plane. The critical 
points of (1) can be found, at least in principle, by solving the simultane¬ 
ous equations F(x,y) = 0 and G(x,y) = 0. There are four simple types of criti¬ 
cal points that occur quite frequently, and our purpose in this section is to 
describe them in terms of the configurations of nearby paths. First, however, 
we need two definitions. 

Let (x 0 ,y 0 ) be an isolated critical point of (1). If C = [x(f),y(f)] is a path of (1), 
then we say that C approaches (x 0 ,y 0 ) as t °o if 
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limx(f) = x 0 and limy(f) = y 0 . 5 (2) 

t-> 00 t-> CO 

Geometrically, this means that if P = (x,y) is a point that traces out C in accor¬ 
dance with the equations x = x(t) and y = y(t), then P -* (x 0 ,y 0 ) as t -*■ If it is 
also true that 


x(f)-x 0 


(3) 


exists, or if the quotient in (3) becomes either positively or negatively infi¬ 
nite as t -*■ °°, then we say that C enters the critical point (x 0 ,y 0 ) as t -> 
The quotient in (3) is the slope of the line joining (x 0 ,y 0 ) and the point P with 
coordinates x(t) and y(f), so the additional requirement means that this line 
approaches a definite direction as t -* In the above definitions, we may also 
consider limits as t -* It is clear that these properties are properties of the 
path C, and do not depend on which solution is used to represent this path. 

It is sometimes possible to find explicit solutions of the system (1), and 
these solutions can then be used to determine the paths. In most cases, how¬ 
ever, to find the paths it is necessary to eliminate t between the two equa¬ 
tions of the system, which yields 

dy _ G(x,y) (4) 

dx F(x,y) 

This first order equation gives the slope of the tangent to the path of (1) that 
passes through the point (x,y), provided that the functions F and G are not 
both zero at this point. In this case, of course, the point is a critical point 
and no path passes through it. The paths of (1) therefore coincide with the 
one-parameter family of integral curves of (4), and this family can often be 
obtained by the methods of Chapter 2. It should be noted, however, that 
while the paths of (1) are directed curves, the integral curves of (4) have no 
direction associated with them. Each of these techniques for determining the 
paths will be illustrated in the examples below. 

We now give geometric descriptions of the four main types of critical 
points. In each case we assume that the critical point under discussion is the 
origin O = (0,0). 


Nodes. A critical point like that in Figure 73 is called a node. Such a point is 
approached and also entered by each path as t -* °° (or as t -> -°°). For the 
node shown in Figure 73, there are four half-line paths, AO, BO, CO, and 
DO, which together with the origin make up the lines AB and CD. All other 


5 It can be proved that if (2) is true for some solution x(t), y(t), then ( x 0 , y 0 ) is necessarily a critical 
point. See F. G. Tricomi, Differential Equations, p. 47, Blackie, Glasgow, 1961. 
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FIGURE 73 


paths resemble parts of parabolas, and as each of these paths approaches O 
its slope approaches that of the line AB. 


Example 1. Consider the system 

dx 
dt 
dy_ 
_ dt 


= X 

= -x + 2 y. 


(5) 


It is clear that the origin is the only critical point, and the general solu¬ 
tion can be found quite easily by the methods of Section 56: 


X = CjC 

y = c 2 e l + c 2 e 2t 


( 6 ) 


When c 1 = 0, we have x = 0 and y = c 2 e 2t . In this case the path (Figure 74) is 
the positive y-axis when c 2 > 0, and the negative y-axis when c 2 < 0, and 
each path approaches and enters the origin as t -> When c 2 = 0, we 
have x=c l e t and y=c 2 e l . This path is the half-line y=x, x> 0, when c x > 0, 
and the half-line y = x, x < 0, when c 2 < 0, and again both paths approach 
and enter the origin as t -> When both c 2 and c 2 are ^ 0, the paths 
lie on the parabolas y = x + (c 2 /c 2 )x 2 , which go through the origin with 
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FIGURE 74 


slope 1. It should be understood that each of these paths consists of only 
part of a parabola, the part with x > 0 if c 1 > 0, and the part with x < 0 
if c 1 < 0. Each of these paths also approaches and enters the origin as 
t —> -o°; this can be seen at once from (6). If we proceed directly from (5) 
to the differential equation 


dy _ -x+ 2y ' 
dx x 

giving the slope of the tangent to the path through ( x,y ) [provided ( x,y ) 
^ (0,0)], then on solving (7) as a homogeneous equation, we find that 
y=x + cx 2 . This procedure yields the curves on which the paths lie (except 
those on the \j axis), but gives no information about the manner in which 
the paths are traced out. It is clear from this discussion that the critical 
point (0,0) of the system (5) is a node. 


Saddle points. A critical point like that in Figure 75 is called a saddle point. It 
is approached and entered by two half-line paths AO and BO as t -> and 
these two paths lie on a line AB. It is also approached and entered by two 
half-line paths CO and DO at t -» -°°, and these two paths lie on another line 
CD. Between the four half-line paths there are four regions, and each con¬ 
tains a family of paths resembling hyperbolas. These paths do not approach 
O as t -> °° or as t -> but instead are asymptotic to one or another of the 
half-line paths as t °o and as t 
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FIGURE 75 


Centers. A center (sometimes called a vortex) is a critical point that is sur¬ 
rounded by a family of closed paths. It is not approached by any path as 
t —» °° or as t —> — 


Example 2. The system 


dx 


dy_ 

_ dt 


=-y 


= X 


( 8 ) 


has the origin as its only critical point, and its general solution is 


[ x = -Ci sin t + c 2 cos t 
[y = CiCOSf + C 2 sinf. 


(9) 


The solution satisfying the conditions x(0) = 1 and y(0) - 0 is clearly 


x = cos t 
y = sinf; 


( 10 ) 
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and the solution determined by x(0) = 0 and y(0) = -1 is 


x = sinf = cos| t- — 


y = —cost = sin| f- — 


( 11 ) 


These two different solutions define the same path C (Figure 76), which 
is evidently the circle x 2 +y 2 = l. Both (10) and (11) show that this path is 
traced out in the counterclockwise direction. If we eliminate t between 
the equations of the system, we get 

dy _ x 
dx y 

whose general solution x 2 +y 2 =c 2 yields all the paths (but without their 
directions). It is obvious that the critical point (0,0) of the system (8) is a 
center. 


Spirals. A critical point like that in Figure 77 is called a spiral (or sometimes 
a focus). Such a point is approached in a spiral-like manner by a family of 
paths that wind around it an infinite number of times as t -* °° (or as t -* 



FIGURE 76 
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FIGURE 77 


Note particularly that while the paths approach O, they do not enter it. That 
is, a point P moving along such a path approaches O as t -* °° (or as t -> 
but the line OP does not approach any definite direction. 


Example 3. If a is an arbitrary constant, then the system 


dx 

— = ax-y 
dt 

dy 

—^ = x +mi 
[dt J 


( 12 ) 


has the origin as its only critical point (why?). The differential equation 
of the paths. 


dy _x +ay 
dx ax-y 


(13) 


is most easily solved by introducing polar coordinates r and 0 defined by 
x = r cos 0 and y = r sin 0. Since 

r 2 =x 2 + y 2 and 0 = tan“ 1 ^, 

X 
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we see that 


dr 
r — 
dx 


x + y 


dy_ 

dx 


and 


2 dO 

r — 
dx 



-y- 


With the aid of these equations, (13) can easily be written in the very 
simple form 


dr 

ho 


= ar, 


so 


r = ce a6 (14) 

is the polar equation of the paths. The two possible spiral configurations 
are shown in Figure 78 and the direction in which these paths are tra¬ 
versed can be seen from the fact that dx/dt = -y when x = 0. If a = 0, then 
(12) collapses to (8) and (14) becomes r = c, which is the polar equation of 
the family x 2 +y 2 =c 2 of all circles centered on the origin. This example 
therefore generalizes Example 2; and since the center shown in Figure 76 
stands on the borderline between the spirals of Figure 78, a critical point 
that is a center is often called a borderline case. We will encounter other 
borderline cases in the next section. 


We now introduce the concept of stability as it applies to the critical points of 
the system (1). 

It was pointed out in the previous section that one of the most important 
questions in the study of a physical system is that of its steady states. However, 
a steady state has little physical significance unless it has a reasonable degree 



y 



X 


a < 0 


FIGURE 78 
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m • 


FIGURE 79 


of permanence, i.e., unless it is stable. As a simple example, consider the pen¬ 
dulum of Figure 79. There are two steady states possible here: when the bob 
is at rest at the highest point, and when the bob is at rest at the lowest point. 
The first state is clearly unstable, and the second is stable. We now recall 
that a steady state of a simple physical system corresponds to an equilibrium 
point (or critical point) in the phase plane. These considerations suggest in a 
general way that a small disturbance at an unstable equilibrium point leads 
to a larger and larger departure from this point, while the opposite is true at 
a stable equilibrium point. 

We now formulate these intuitive ideas in a more precise way. Consider 
an isolated critical point of the system (1), and assume for the sake of con¬ 
venience that this point is located at the origin O = (0,0) of the phase plane. 
This critical point is said to be stable if for each positive number R there 
exists a positive number r < R such that every path which is inside the cir¬ 
cle x 2 + y 2 =r 2 for some t = t 0 remains inside the circle x 2 +y 2 = R 2 for all t > t 0 
(Figure 80). Loosely speaking, a critical point is stable if all paths that get 
sufficiently close to the point stay close to the point. Further, our critical 
point is said to be asymptotically stable if it is stable and there exists a circle 
x 2 + y 2 = r 2 such that every path which is inside this circle for some t-t 0 
approaches the origin as t -* Finally, if our critical point is not stable, then 
it is called unstable. 

As examples of these concepts, we point out that the node in Figure 74, the 
saddle point in Figure 75, and the spiral on the left in Figure 78 are unstable, 
while the center in Figure 76 is stable but not asymptotically stable. The node 
in Figure 73, the spiral in Figure 77, and the spiral on the right in Figure 78 
are asymptotically stable. 
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FIGURE 80 


Problems 


1. For each of the following nonlinear systems: (i) find the critical points; 
(ii) find the differential equation of the paths; (iii) solve this equation to 
find the paths; and (iv) sketch a few of the paths and show the direction 
of increasing t. 


(a) 


(b) 


(c) 


dx 
dt 
dy_ 
„ dt 

dx 
dt 
dy 
„ dt 

dx 

dt 

dy 


_ dt 


y(x 2 + 1) 

2 xy 2 ; 

y(x 2 + 1) 

-x(x 2 + 1); 

e y cosx; 
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(d) 


dx 

dt 

dy_ 

At 


- -X 

- 2x 2 y 2 - 


2 . 


Each of the following linear systems has the origin as an isolated criti¬ 
cal point, (i) Find the general solution, (ii) Find the differential equation 
of the paths, (iii) Solve the equation found in (ii) and sketch a few of the 
paths, showing the direction of increasing t. (iv) Discuss the stability of 
the critical point. 


(a) 


dx 

dt 

dy_ 

dt 


x 

-y ; 


(b) 


dx 

dt 

dy_ 

, dt 


-x 


- 2 y ; 


(C) 


dx 

dt 

dy 


. dt 


4 y 

-X. 


3. Sketch the phase portrait of the equation d 2 x/dt 2 -2x 3 , and show that it 
has an unstable isolated critical point at the origin. 


60 Critical Points and Stability for Linear Systems 

Our goal in this chapter is to learn as much as we can about nonlinear dif¬ 
ferential equations by studying the phase portraits of nonlinear autonomous 
systems of the form 


f -*■*> 

One aspect of this is the problem of classifying the critical points of such a 
system with respect to their nature and stability. It will be seen in Section 62 
that under suitable conditions this problem can be solved for a given non¬ 
linear system by studying a related linear system. We therefore devote this 
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section to a complete analysis of the critical points of linear autonomous 
systems. 

We consider the system 


dx , 

— = fljx + bpj 

at 

^ = a 2 x + b 2 y, 


( 1 ) 


which has the origin (0,0) as an obvious critical point. We assume throughout 
this section that 


«i 

«2 



( 2 ) 


so that (0,0) is the only critical point. It was proved in Section 56 that (1) has a 
nontrivial solution of the form 


J x = Ae mt 
[y = Be mt 


whenever m is a root of the quadratic equation 

m 2 - (rtj + b^)m + ( af 2 ~ «2&i) = 0/ (3) 

which is called the auxiliary equation of the system. Observe that condition (2) 
implies that zero cannot be a root of (3). 

Let m 1 and m 2 be the roots of (3). We shall prove that the nature of the 
critical point (0,0) of the system (1) is determined by the nature of the num¬ 
bers m j and m 2 . It is reasonable to expect that three possibilities will occur, 
according as m l and m 2 are real and distinct, real and equal, or conjugate 
complex. Unfortunately the situation is a little more complicated than this, 
and it is necessary to consider five cases, subdivided as follows. 

Major cases: 

Case A. The roots in, and m 2 are real, distinct, and of the same sign (node). 
Case B. The roots m 1 and m 2 are real, distinct, and of opposite signs (saddle 
point). 

Case C. The roots m, and m 2 are conjugate complex but not pure imaginary 
(spiral). 

Borderline cases: 

Case D. The roots m 1 and m 2 are real and equal (node). 

Case E. The roots m 1 and m 2 are pure imaginary (center). 
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The reason for the distinction between the major cases and the borderline 
cases will become clear in Section 62. For the present it suffices to remark 
that while the borderline cases are of mathematical interest they have lit¬ 
tle significance for applications, because the circumstances defining them 
are unlikely to arise in physical problems. We now turn to the proofs of the 
assertions in parentheses. 


Case A. If the roots m 1 and m 2 are real, distinct, and of the same sign, then 
the critical point (0,0) is a node. 

Proof. We begin by assuming that m l and m 2 are both negative, and we 
choose the notation so that m 1 < m 2 < 0. By Section 56, the general solution of 
(1) in this case is 


jx = c 1 A 1 e mit + c 2 A 2 e m2t 
\y = c 1 B 1 e mt + c 2 B 2 e m2t , 

where the A's and B's are definite constants such that BJA X / B 2 /A 2 , and 
where the c's are arbitrary constants. When c 2 = 0, we obtain the solutions 


jx = CiA 1 e mt 

\y = c 1 B 1 e mit , 

and when c 1 = 0, we obtain the solutions 


(5) 


jx — c 2 A 2 e mit 
[y = c 2 B 2 e"‘ 2t . 

For any c, > 0, the solution (5) represents a path consisting of half of the line 
A x y = B 1 x with slope B 1 /A 1 ; and for any q < 0, it represents a path consist¬ 
ing of the other half of this line (the half on the other side of the origin). 
Since rn ] < 0, both of these half-line paths approach (0,0) as f -»■ °°; and since 
y/x = B x /A v both enter (0,0) with slope 6,/h, (Figure 81). In exactly the same 
way, the solutions (6) represent two half-line paths lying on the line A 2 ij = B 2 x 
with slope B 2 /A 2 . These two paths also approach (0,0) as t -* °°,and enter it 
with slope B 2 /A 2 . 

If q # 0 and c 2 # 0, the general solution (4) represents curved paths. Since 
, < 0 and m 2 < 0, these paths also approach (0,0) as f Furthermore, since 
;«j - m 2 < 0 and 


y _ CiB x e mit + c 2 B 2 e mi ‘ _ (c 1 B 1 /c 2 )e im - m2)t + B 2 
x ~ c 1 A 1 e mt +c 2 A 2 e m2t _ (c 1 A 1 /c 2 )e {m - m2)t + A 2 ' 
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FIGURE 81 


it is clear that y/x -> B 2 /A 2 as t so all of these paths enter (0,0) with slope 
B 2 /A 2 . Figure 81 presents a qualitative picture of the situation. It is evident 
that our critical point is a node, and that it is asymptotically stable. 

If m 1 and m 2 are both positive, and if we choose the notation so that m 1 > 
m 2 > 0, then the situation is exactly the same except that all the paths now 
approach and enter (0,0) as t -+-<= 0 . The picture of the paths given in Figure 81 
is unchanged except that the arrows showing their directions are all reversed. 
We still have a node, but now it is unstable. 

Case B. If the roots m 1 and m 2 are real, distinct, and of opposite signs, then 
the critical point (0,0) is a saddle point. 

Proof. We may choose the notation so that m 1 < 0 < m 2 . The general solu¬ 
tion of (1) can still be written in the form (4), and again we have particular 
solutions of the forms (5) and (6). The two half-line paths represented by (5) 
still approach and enter (0,0) as t -* °°, but this time the two half-line paths 
represented by (6) approach and enter (0,0) as t ->■ If c x # 0 and c 2 / 0, the 
general solution (4) still represents curved paths, but since m l < 0 < m 2 , none 
of these paths approaches (0,0) as t -> °° or t -> Instead, as t °°, each of 
these paths is asymptotic to one of the half-line paths represented by (6); and 
as t -> - 00 , each is asymptotic to one of the half-line paths represented by (5). 
Figure 82 gives a qualitative picture of this behavior. In this case the critical 
point is a saddle point, and it is obviously unstable. 

Case C. If the roots m v and m 2 are conjugate complex but not pure imaginary, 
then the critical point (0,0) is a spiral. 
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FIGURE 82 


Proof. In this case we can write m 1 and m 2 in the form a ± ib where a and b are 
nonzero real numbers. Also, for later use, we observe that the discriminant 
D of equation (3) is negative: 


D-(a 1 + bfj 1 - A(a l b 2 - a 2 b^) 

= (flj - b 2 ) 2 + 4fl 2 bi < 0. (7) 

By Section 56, the general solution of (1) in this case is 


jx = e at [c 1 (A 1 cos bt - A 2 sin bt) + c 2 (Ai sin bt + A 2 cos bt)] 
jy = e^lcfBi cos bt - B 2 sin bt) + c 2 (B 1 sin bt + B 2 cos bt)], 

where the A's and B’s are definite constants and the c's are arbitrary 
constants. 

Let us first assume that a < 0. Then it is clear from formulas (8) that x ^ 0 
and y -> 0 as t <*>, so all the paths approach (0,0) as t -»-°°. We now prove that 
the paths do not enter the point (0,0) as t —>• °°, but instead wind around it in 
a spiral-like manner. To accomplish this we introduce the polar cordinate 0 
and show that, along any path, dQ/dt is either positive for all t or negative for 
all t. We begin with the fact that 0 = tarn 1 (y/x), so 
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dQ xdy / dt - ydx / dt 


dt 


and by using equations (1) we obtain 


dO a 2 x 2 +(b 2 -a 1 )xy-b 1 y 2 
dt x 2 + y 2 


(9) 


Since we are interested only in solutions that represent paths, we assume 
that x 2 + y 2 f 0. Now (7) implies that a 2 and b 1 have opposite signs. We con¬ 
sider the case in which a 2 > 0 and b 1 < 0. When y = 0, (9) yields dQ/dt -a 2 > 0. If 
y # 0, dO/dt cannot be 0; for if it were, then (9) would imply that 


a 2 x 2 +(b 2 - afxy - b t y 2 -0 


or 



( 10 ) 


for some real number x/y —and this cannot be true because the discriminant 
of the quadratic equation (10) is D, which is negative by (7). This shows that 
dQ/dt is always positive when a 2 > 0, and in the same way we see that it is 
always negative when a 2 < 0. Since by (8), x and y change sign infinitely often 
as t all paths must spiral in to the origin (counterclockwise or clockwise 
according as a 2 > 0 or a 2 < 0). The critical point in this case is therefore a spiral, 
and it is asymptotically stable. 

If a > 0, the situation is the same except that the paths approach (0,0) as 
t -* -oo and the critical point is unstable. Figure 78 illustrates the arrange¬ 
ment of the paths when a 2 > 0. 

Case D. If the roots m 1 and m 2 are real and equal, then the critical point (0,0) 
is a node. 

Proof. We begin by assuming that m : = m 2 = m < 0. There are two subcases 
that require separate discussion: (i) a r = b 2 / 0 and n 2 =b t =0; (ii) all other pos¬ 
sibilities leading to a double root of equation (3). 

We first consider the subcase (i), which is the situation described in the 
footnote in Section 56. If a denotes the common value of a x and b 2 , then equa¬ 
tion (3) becomes m 2 - 2am + a 2 ~ 0 and in-a. The system (1) is thus 


dx 


— = ax 
dt 
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and its general solution is 



( 11 ) 


where c 1 and c 2 are arbitrary constants. The paths defined by (11) are half¬ 
lines of all possible slopes (Figure 83), and since m< 0 we see that each path 
approaches and enters (0,0) as f -* The critical point is therefore a node, 
and it is asymptotically stable. If m > 0, we have the same situation except 
that the paths enter (0,0) as t -> -°°, the arrows in Figure 83 are reversed, and 
(0,0) is unstable. 

We now discuss subcase (ii). By formulas 56-(20) and Problem 56-(4), the 
general solution of (1) can be written in the form 


x=c 1 Ae mt +c 2 (A 1 +At)e mt 
y = c 1 Be mt + c 2 (B 1 +Bt)e mt , 


( 12 ) 


where the /Vs and B's are definite constants and the c's are arbitrary con¬ 
stants. When c 2 - 0, we obtain the solutions 


y 



X 


FIGURE 83 
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jx = CiAe mt 
jy = c x Be mt . 


(13) 


We know that these solutions represent two half-line paths lying on the line 
Ay = Bx with slope B/A, and since m < 0 both paths approach (0,0) as t -> °° 
(Figure 84). Also, since y/x-B/A, both paths enter (0,0) with slope B/A. If 
c 2 # 0, the solutions (12) represent curved paths, and since m < 0 it is clear 
from (12) that these paths approach (0,0) as t -> Furthermore, it follows 
from 


y Ci6e mt + c 2 (Bi + Bt)e mt c t B/c 2 + B 1 + Bt 
x CiAe mt + c 2 {Ai + At)e m ‘ CiA/c 2 + A 1 + At 


that y/x B/A as t -> °°, so these curved paths all enter (0,0) with slope B/A. 
We also observe that y/x -> B/A as f —> Figure 84 gives a qualitative pic¬ 
ture of the arrangement of these paths. It is clear that (0,0) is a node that is 
asymptotically stable. If m > 0, the situation is unchanged except that the 
directions of the paths are reversed and the critical point is unstable. 

Case E. If the roots m 1 and m 2 are pure imaginary, then the critical point (0,0) 
is a center. 



FIGURE 84 
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FIGURE 85 

Proof. It suffices here to refer back to the discussion of Case C, for now m 1 
and m 2 are of the form a ± ib with a- 0 and b / 0. The general solution of 
(1) is therefore given by (8) with the exponential factor missing, so x(t) and 
y(t) are periodic and each path is a closed curve surrounding the origin. As 
Figure 85 suggests, these curves are actually ellipses; this can be proved (see 
Problem 5) by solving the differential equation of the paths, 

dy = a 2 x + b 2 y ' (14) 

dx a r x + b x y 

Our critical point (0,0) is evidently a center that is stable but not asymptoti¬ 
cally stable. 

In the above discussions we have made a number of statements about sta¬ 
bility. It will be convenient to summarize this information as follows. 


Theorem A. The critical point (0,0) of the linear system (1) is stable if and only if 
both roots of the auxiliary equation (3) have nonpositive real parts, and it is asymp¬ 
totically stable if and only if both roots have negative real parts. 


If we now write equation (3) in the form 


(m - mf){m - mf) = m 2 + pm + q = 0 , 


( 15 ) 
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FIGURE 86 


so that p = ~(m 1 + m 2 ) and q = m 1 m 2 , then our five cases can be described just as 
readily in terms of the coefficients p and q as in terms of the roots m 1 and m 2 . 
In fact, if we interpret these cases in the pq-plane, then we arrive at a striking 
diagram (Figure 86) that displays at a glance the nature and stability proper¬ 
ties of the critical point (0,0). The first thing to notice is that the p-axis q = 0 is 
excluded, since by condition (2) we know that m l m 2 / 0. In the light of what 
we have learned about our five cases, all of the information contained in the 
diagram follows directly from the fact that 



Thus, above the parabola p 2 - Aq = 0, we have p 2 - Aq < 0, so in , and m 2 are 
conjugate complex numbers that are pure imaginary if and only if p = 0; these 
are Cases C and E comprising the spirals and centers. Below the p-axis we 
have q < 0, which means that m, and m 2 are real, distinct, and have opposite 
signs; this yields the saddle points of Case B. And finally, the zone between 
these two regions (including the parabola but excluding the p-axis) is charac¬ 
terized by the relations p 2 - Aq > 0 and q > 0, so m 1 and m 2 are real and of the 
same sign; here we have the nodes of Cases A and D. Furthermore, it is clear 
that there is precisely one region of asymptotic stability: the first quadrant. 
We state this formally as follows. 

Theorem B. The critical point (0,0) of the linear system (1) is asymptotically stable 
if and only if the coefficients p = ~(a 1 + in) and q = a 1 b 2 - a 2 b 1 of the auxiliary equation 
(3) are both positive. 


Finally, it should be emphasized that we have studied the paths of our linear 
system near a critical point by analyzing explicit solutions of the system. 
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In the next two sections we enter more fully into the spirit of the subject by 
investigating similar problems for nonlinear systems, which in general can¬ 
not be solved explicitly. 


Problems 


1 . 


Determine the nature and stability properties of the critical point (0,0) 
for each of the following linear autonomous systems: 
dx 
dt 


(a) 


= 2x 


(b) 


dy 

dt 

dx 

dt 


3 y; 

-x-2y 


(c) 

(d) 


dy 
. dt 
dx 

It 
dy_ 
. dt 
dx 
dt 


4x-5y; 
-3x + Ay 
-2x + 3y; 
5x + 2y 


(e) 


dy 
. dt 
dx 
dt 


-17x-5y; 


-Ax-y 


(f) 


dy 

dt 

dx 

dt 

dy 

dt 


x-2 y; 
4x-3y 
8x-6y; 


2 . 



4x-2y 


— = 5x + 2y. 

[dt 

If a 1 b 2 - a 2 b l = 0, show that the system (1) has infinitely many critical 
points, none of which are isolated. 
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3. (a) If a t b 2 - a 2 b 1 / 0, show that the system 

dx 

— = flix + bpy + Ci 
, dy 

— = a 2 x + b 2 y + c 2 
. dt 

has a single isolated critical point (x 0 ,y 0 ). 

(b) Show that the system in (a) can be written in the form of (1) by 
means of the change of variables x = x - x 0 and y = y - y 0 . 

(c) Find the critical point of the system 

dX -~2x-2y + U) 
dt 

< 

— = llx-8y+ 49, 

. dt 


write the system in the form of (1) by changing the variables, and deter¬ 
mine the nature and stability properties of the critical point. 

4. In Section 20 we studied the free vibrations of a mass attached to a 
spring by solving the equation 


d 2 x 

dt 2 


+ 2 b 


dx 

dt 


+ a 2 x = 0, 


where b > 0 and a > 0 are constants representing the viscosity of the 
medium and the stiffness of the spring, respectively. Consider the 
equivalent autonomous system 



— = -a 2 x - 2 by, 
dt J 


(*) 


which has (0,0) as its only critical point. 

(a) Find the auxiliary equation of (*). What are p and q? 

(b) For each of the following four cases, describe the nature and stabil¬ 
ity properties of the critical point, and give a brief physical interpre¬ 
tation of the corresponding motion of the mass: 

(i) b = 0; 

(ii) 0 <b < a; 

(iii) b-a; 

(iv) b > a. 
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5. Solve equation (14) under the hypotheses of Case E, and show that the 
result is a one-parameter family of ellipses surrounding the origin. 
Hint: Recall that if Ax 2 + Bxy + Cy 2 -D is the equation of a real curve, 
then the curve is an ellipse if and only if the discriminant B 2 - 4 AC is 
negative. 


61 Stability By Liapunov's Direct Method 

It is intuitively clear that if the total energy of a physical system has a local 
minimum at a certain equilibrium point, then that point is stable. This idea 
was generalized by Liapunov 6 7 into a simple but powerful method for study¬ 
ing stability problems in a broader context. We shall discuss Liapunov's 
method and some of its applications in this and the next section. 

Consider an autonomous system 


d vr = F (x,y) 

at 

^ = G(x,y), 


( 1 ) 


and assume that this system has an isolated critical point, which as usual 
we take to be the origin (0,0)7 Let C = [x(f),y(f)] be a path of (1), and consider a 
function E(x,y) that is continuous and has continuous first partial derivatives 
in a region containing this path. If a point (x,y) moves along the path in accor¬ 
dance with the equations x = x(f) and y = y(t), then E(x,y) can be regarded as a 
function of t along C [we denote this function by E(t)] and its rate of change is 


dE dE dx dE dy 
dt dx dt dy dt 


dE _ dE 

— F + — 
dx dy 


G. 


( 2 ) 


6 Alexander Mikhailovich Liapunov (1857-1918) was a Russian mathematician and mechani¬ 
cal engineer. He had the very rare merit of producing a doctoral dissertation of lasting value. 
This classic work was originally published in 1892 in Russian, but is now available in an 
English translation. Stability of Motion, Academic Press, New York, 1966. Liapunov died by 
violence in Odessa, which cannot be considered a surprising fate for a middle-class intel¬ 
lectual in the chaotic aftermath of the Russian Revolution. 

7 A critical point (x 0 ,y 0 ) can always be moved to the origin by a simple translation of coordi¬ 
nates x =x-x 0 and y = y-y 0 , so there is no loss of generality in assuming that it lies at the 
origin in the first place. 
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This formula is at the heart of Liapunov's ideas, and in order to exploit it 
we need several definitions that specify the kinds of functions we shall be 
interested in. 

Suppose that E{x,y) is continuous and has continuous first partial deriva¬ 
tives in some region containing the origin. If E vanishes at the origin, so 
that £(0,0) = 0, then it is said to be positive definite if E(x,y) > 0 for ( x,y) / (0,0), 
and negative definite if E(x,y) < 0 for ( x,y) / (0,0). Similarly, £ is called positive 
semidefinite if E(0,0) = 0 and E(x,y) > 0 for ( x,y) / (0,0), and negative semidefinite 
if E(0,0) = 0 and E(x,y) < 0 for (x,y) / (0,0). It is clear that functions of the form 
ax 2m + by 2n , where a and b are positive constants and in and n are positive inte¬ 
gers, are positive definite. Since E(x,y) is negative definite if and only if -E(x,y) 
is positive definite, functions of the form ax 2 '" + by 2 " with a < 0 and b < 0 are 
negative definite. The functions x 2m , y 2m , and (x - if) 2 '" are not positive defi¬ 
nite, but are nevertheless positive semidefinite. If E(x,y) is positive definite, 
then z = E(x,y) can be interpreted as the equation of a surface (Figure 87) that 
resembles a paraboloid opening upward and tangent to the xy-plane at the 
origin. 

A positive definite function E(x,y) with the property that 


dE 

dx 


F + 



(3) 



FIGURE 87 
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is negative semidefinite is called a Liapunov function for the system (1). By 
formula (2), the requirement that (3) be negative semidefinite means that 
dE/dt < 0—and therefore £ is nonincreasing—along the paths of (1) near the 
origin. These functions generalize the concept of the total energy of a physi¬ 
cal system. Their relevance for stability problems is made clear in the follow¬ 
ing theorem, which is Liapunov's basic discovery. 

Theorem A. If there exists a Liapunov function E(x,y) for the system (1), then the 
critical point (0,0) is stable. Furthermore, if this function has the additional property 
that the function (3) is negative definite, then the critical point (0,0) is asymptotically 
stable. 

Proof. Let C, be a circle of radius R > 0 centered on the origin (Figure 88), and 
assume also that C, is small enough to lie entirely in the domain of definition 
of the function E. Since E(x,y) is continuous and positive definite, it has a pos¬ 
itive minimum m on C,. Next, E(x,y) is continuous at the origin and vanishes 
there, so we can find a positive number r < R such that E(x,y) < m whenever 
(x,y) is inside the circle C 2 of radius r. Now let C = [x(t), y{i)\ be any path which 
is inside C 2 for t = t 0 . Then E(t 0 ) < m, and since (3) is negative semidefinite we 
have dE/dt < 0, which implies that E(t) < E(t 0 ) < m for all t > t 0 . It follows that 
the path C can never reach the circle C, for any t > t 0 , so we have stability. 

To prove the second part of the theorem, it suffices to show that under 
the additional hypothesis we also have E(t) -> 0, for since E(x,y) is positive 
definite this will imply that the path C approaches the critical point (0,0). 



FIGURE 88 
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We begin by observing that since dE/dt < 0, it follows that E(t) is a decreasing 
function; and since by hypothesis E(t) is bounded below by 0, we conclude 
that E(t) approaches some limit L > 0 as t -* To prove that E(t) -* 0 it suf¬ 
fices to show that L = 0, so we assume that L > 0 and deduce a contradiction. 
Choose a positive number r <r with the property that E(x,y) < L whenever 
( x,y ) is inside the circle C 3 with radius r. Since the function (3) is continuous 
and negative definite, it has a negative maximum -k in the ring consisting 
of the circles Q and C 3 and the region between them. This ring contains the 
entire path C for t > t 0 , so the equation 



yields the inequality 


£(0 £ E(t 0 ) - k(t - t 0 ) 


(4) 


for all t > f 0 . However, the right side of (4) becomes negatively infinite as 
t -*■ °°, so E(t) -> as t -> This contradicts the fact that E(x,y) > 0, so we 
conclude that L = 0 and the proof is complete. 

Example 1. Consider the equation of motion of a mass m attached to a 
spring: 



dC i it 


(5) 


Here c > 0 is a constant representing the viscosity of the medium through 
which the mass moves, and k > 0 is the spring constant. The autonomous 
system equivalent to (5) is 


dx 


_ dt 



— x- y, 

m m 


( 6 ) 


and its only critical point is (0,0). The kinetic energy of the mass is my 2 /2, 
and the potential energy (or the energy stored in the spring) is 
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Thus the total energy of the system is 

E {x,y) = ^my 2 + hcx 2 - (7) 

It is easy to see that (7) is positive definite; and since 

8E 8E , ( k c \ 

— F H-G = kxy + my\ - x -y 

dx 8y v m m J 

= - cy 2 < 0 , 

(7) is a Liapunov function for (6) and the critical point (0,0) is stable. We 
know from Problem 60-4 that when c > 0 this critical point is asymptoti¬ 
cally stable, but the particular Liapunov function discussed here is not 
capable of detecting this fact. 8 


Example 2. The system 


dx 

dt 

dy_ 

dt 



( 8 ) 


has (0,0) as an isolated critical point. Let us try to prove stability by con¬ 
structing a Liapunov function of the form E(x,y)=ax 2m + by 2n . It is clear 
that 


— F +—G = 2max lm -\-2xy) + 2nby ln ~\x 2 - y 3 ) 
dx dy 

= (-4 max 2m y + 2 nbx 2 y ln ~ 1 )- 2nby ln+1 . 

We wish to make the expression in parentheses vanish, and inspection 
shows that this can be done by choosing m = l,n = l,a= 1, and b = 2. With 
these choices we have E{x,y)=x 2 +2y 2 (which is positive definite) and 
(dE/dx)F + (dE/dy)G = -4y 4 (which is negative semidefinite). The critical 
point (0,0) of the system (8) is therefore stable. 


It is clear from this example that in complicated situations it may be very dif¬ 
ficult indeed to construct suitable Liapunov functions. The following result 
is sometimes helpful in this connection. 


It is known that both stability and asymptotic stability can always be detected by suitable 
Liapunov functions, but knowing in principle that such a function exists is a very differ¬ 
ent matter from actually finding one. For references on this point, see L. Cesari, Asymptotic 
Behavior and Stability Problems in Ordinary Differential Equations, p. Ill, Academic Press, 
New York, 1963; or G. Sansone and R. Conti, Non-Linear Differential Equations, p. 481, Macmillan, 
New York, 1964. 
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Theorem B The function E(x,y) = ax 2 + bxy + cy 2 is positive definite if and only if 
a > 0 and b 2 - Aac < 0, and is negative definite if and only if a < 0 and b 2 - 4 ac < 0. 

Proof. If y = 0, we have E(x,0)-ax 2 , so E(x,0) > 0 for x / 0 if and only if a > 0. 
If y ^ 0, we have 


E(x,y) = y 2 



f 

2 

f \ 



X 

+ b 

X 


a 

— 

— 

+ c 


UJ 


1 y) 



and when a > 0 the bracketed polynomial in x/y (which is positive for large 
x/y) is positive for all x/y if and only if b 2 - 4ae < 0. This proves the first part 
of the theorem, and the second part follows at once by considering the func¬ 
tion -E(x,y). 


Problems 


1. Determine whether each of the following functions is positive definite, 
negative definite, or neither: 

(a) x 2 - xy - y 2 ; 

(b) 2x 2 - 3xi/ + 3y 2 ; 

(c) -2x 2 + 3xy - y 2 ; 

(d) -x 2 - 4xy - 5y 2 . 

2. Show that a function of the form ax 3 + bx 2 y + cxy 2 + dy 3 cannot be either 
positive definite or negative definite. 

3. Show that (0,0) is an asymptotically stable critical point for each of the 
following systems: 

dx „ 3 
= -3x - y 


(a) 


(b) 


dt 


dx 

dt 

dy 

dt 


= -2 x + xy 3 


2 2 

=-* y ■ 


4. Prove that the critical point (0,0) of the system (1) is unstable if there 
exists a function E(x,y) with the following properties: 

(a) £(x,y) is continuous and has continuous first partial derivatives in 
some region containing the origin; 
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(b) £(0,0) = 0; 

(c) every circle centered on (0,0) contains at least one point where E(x,y) 
is positive; 

(d) ( dE/dx)F + {dE/dy)G is positive definite. 

5. Show that (0,0) is an unstable critical point for the system 


dx 

It 


2xy + x 3 



+y 5 - 


6. Assume that f(x) is a function such that/(0) = 0 and xf(x) > 0 for x / 0 
[that is,/(x) > 0 when x > 0 and f(x) < 0 when x < 0]. 

(a) Show that 


E(x,y) 


|y 2 +j/(x)dx 
o 


is positive definite. 

(b) Show that the equation 


d 2 x 

dt 2 


+ /(x) = 0 


has x-0,y = dx/dt = 0 a s a stable critical point. 

(c) If g(x) > 0 in some neighborhood of the origin, show that the equation 

d 2 x dx 

^4 + g(x) — + /(x) = 0 

dt 2 ' dt J 

has x = 0,y = dx/df= 0 as a stable critical point. 


62 Simple Critical Points of Nonlinear Systems 

Consider an autonomous system 


dx 

dt 


= F(x,y) 


% = G(x,y) 
at 


(i) 
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with an isolated critical point at (0,0). If F(x,y) and G(x,y ) can be expanded in 
power series in x and y, then (1) takes the form 

dx 

— = ape + bpy + cpf + dpxy + eqy 2 + ■■■ 

' d , , <2) 

— = a 2 x + b 2 y + c 2 x 2 + d 2 xy + e 2 y 2 + ■■■. 

. dt 

When |x| and \y\ are small—that is, when {x,y) is close to the origin—the 
terms of second degree and higher are very small. It is therefore natural to 
discard these nonlinear terms and conjecture that the qualitative behavior of 
the paths of (2) near the critical point (0,0) is similar to that of the paths of the 
related linear system 


dx 

— = a\X + bry 
dt 

d ^ = a 2 x + b 2 y. 


(3) 


We shall see that in general this is actually the case. The process of replacing 
(2) by the linear system (3) is usually called linearization. 

More generally, we shall consider systems of the form 

dx 

— = a l x + b 1 y + f(x,y) 

' I « 

-by = a 2 x + b 2 y + g{x,y). 

I dt 


It will be assumed that 


so that the related linear system (3) has (0,0) as an isolated critical point; that 
f(x,y) and g(x,y) are continuous and have continuous first partial derivatives 
for all (x,y); and that as ( x,y) -* (0,0) we have 

lim !All =Q and lim =0. (6) 

Observe that conditions (6) imply that/(0,0) = 0 and y(0,0) = 0, so (0,0) is a criti¬ 
cal point of (4); also, it is not difficult to prove that this critical point is isolated 
(see Problem 1). With the restrictions listed above, (0,0) is said to be a simple 
critical point of the system (4). 
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Example 1. In the case of the system 


= -7.x + 3 y + xy 
s= -x + y - Ixy 1 


we have 


fll 

h 


-2 

3 


b 2 


-1 

1 


(7) 


so (5) is satisfied. Furthermore, by using polar coordinates we see that 


\f(x,y)\ |r 2 sin0 cos 9| 

V* +f ~ r 


and 


\g( x >y)\ 

^7 


|2r 3 sin 2 Ocos0| 

1 - L < 2 r 2 , 

r 


so f(x,y)/r and g(x,y)/r -*• 0 as (x,y) -> (0,0) (or as r -> 0). This shows that 
conditions (6) are also satisfied, so (0,0) is a simple critical point of the 
system (7). 


The main facts about the nature of simple critical points are given in the fol¬ 
lowing theorem of Poincare, which we state without proof. 9 


Theorem A. Let (0,0) be a simple critical point of the nonlinear system (4), and 
consider the related linear system (3). If the critical point (0,0) of (3) falls under any 
one of the three major cases described in Section 60, then the critical point (0,0) of (4) 
is of the same type. 


9 Detailed treatments can be found in W. Hurewicz, Lectures on Ordinary Differential Equations, 
pp. 86-98, MIT, Cambridge, Mass., 1958; L. Cesari, Asymptotic Behavior and Stability Problems in 
Ordinary Differential Equations, pp. 157-163, Academic Press, New York, 1963; or F. G. Tricomi, 
Differential Equations, pp. 53-72, Blackie, Glasgow, 1961. 
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As an illustration, we examine the nonlinear system (7) of Example 1, whose 
related linear system is 


dx 
dt 
dy_ 
„ dt 


-2x + 3 y 


-x + ij. 


( 8 ) 


The auxiliary equation of (8) is m 2 +m+ 1 = 0, with roots 

—1 + V3 i 
nh,m 2 = ---. 

Since these roots are conjugate complex but not pure imaginary, we have 
Case C and the critical point (0,0) of the linear system (8) is a spiral. By 
Theorem A, the critical point (0,0) of the nonlinear system (7) is also a spiral. 

It should be understood that while the type of the critical point (0,0) is the 
same for (4) as it is for (3) in the cases covered by the theorem, the actual 
appearance of the paths may be somewhat different. For example. Figure 82 
shows a typical saddle point for a linear system, whereas Figure 89 suggests 
how a nonlinear saddle point might look. A certain amount of distortion is 
clearly present in the latter, but nevertheless the qualitative features of the 
two configurations are the same. 

It is natural to wonder about the two borderline cases, which are not men¬ 
tioned in Theorem A. The facts are these: if the related linear system (3) has a 
borderline node at the origin (Case D), then the nonlinear system (4) can have 



FIGURE 89 
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either a node or a spiral; and if (3) has a center at the origin (Case E), then (4) 
can have either a center or a spiral. For example, (0,0) is a critical point for 
each of the nonlinear systems 


dx 
dt 
dy 
. dt 


= -y-x 


= X 


and 


dx o 

— = -y-x 
dt J 



In each case the related 


linear system is 


dx 

dt 


= -y 


dy 

— = x. 
dt 


(9) 


( 10 ) 


It is easy to see that (0,0) is a center for (10). However, it can be shown that 
while (0,0) is a center for the first system of (9), it is a spiral for the second. 10 

We have already encountered a considerable variety of configurations at 
critical points of linear systems, and the above remarks show that no new 
phenomena appear at simple critical points of nonlinear systems. What about 
critical points that are not simple? The possibilities here can best be appreci¬ 
ated by examining a nonlinear system of the form (2). If the linear terms in 
(2) do not determine the pattern of the paths near the origin, then we must 
consider the second degree terms; if these fail to determine the pattern, then 
the third degree terms must be taken into account, and so on. This suggests 
that in addition to the linear configurations, a great many others can arise, of 
infinite variety and staggering complexity. Several are shown in Figure 90. 
It is perhaps surprising to realize that such involved patterns as these can 
occur in connection with systems of rather simple appearance. For example, 
the three figures in the upper row show the arrangement of the paths of 


dx 

17 = 2xy 
at 

dx , _ , 

— = x 3 - 2xxr 

dt 

"Vs 

1 

II 



- = x 


-4yJ\xy\ 


dx 
dt 

dy , „ 
— = -y + 4xJ\xy 
dt J Vl J 


In the first case, this can be seen at once by looking at Figure 3 and 
equation 3-(8). 

We now discuss the question of stability for a simple critical point. The 
main result here is due to Liapunov: if (3) is asymptotically stable at the ori¬ 
gin, then (4) is also. We state this formally as follows. 


10 See Hurewicz, op. cit., p. 99. 
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FIGURE 90 


Theorem B. Let (0,0) be a simple critical point of the nonlinear system (4), and 
consider the related linear system (3). If the critical point (0,0) of (3) is asymptotically 
stable, then the critical point (0,0) o/(4) is also asymptotically stable. 

Proof. By Theorem 61-A, it suffices to construct a suitable Liapunov function 
for the system (4), and this is what we do. 

Theorem 60-B tells us that the coefficients of the linear system (3) satisfy 
the conditions 


p = -(flj + b 2 )> 0 and q = a r b 2 - a 2 b 1 > 0. 


( 11 ) 


Now define 


by putting 


and 


E(x, y) = — (ax 2 + 2bxy + cy 2 ) 


a = 


a 2 + b 2 + (af} 2 — a 2 bf) 
D ' 


a]a 2 + b 2 b 2 
D 


a 2 + b 2 +(af> 2 -a 2 b 2 ) 


D 
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where 


D=pq = -(a x + b 2 )(a 1 b 2 - a 2 b i). 

By (11), we see that D > 0 and a > 0. Also, an easy calculation shows that 

D 2 (ac-b 2 ) = (a 2 + b 2 )(a\ + b 2 ) 

+ (ai + bi + a 2 + b 2 )(a 1 b 2 - a 2 b 
+ (a 1 b 2 - a 2 b\) 2 - (a x a 2 + b x b 2 ) 2 


= («2 + bl + al + b^)(a x b 2 - a 2 b x ) 

+ l(a x b 2 — a 2 b x y 

> 0 , 

so b 2 - ac < 0. Thus, by Theorem 61-B, we know that the function E(x,ij) is 
positive definite. Furthermore, another calculation (whose details we leave 
to the reader) yields 

f^(«i* + h y) + + b 2 y) = -(x 2 + y 2 ). (12) 

ox oy 

This function is clearly negative definite, so E(x,y) is a Liapunov function for 
the linear system (3)! 1 

We next prove that E(x,y) is also a Liapunov function for the nonlinear 
system (4). If F and G are defined by 

F(x,y) = a : x + b : y+f(x,y) 


and 


G{x,y) = a 2 x + b 2 y+g(x,y), 

then since E is known to be positive definite, it suffices to show that 


dE 

dx 


F + 



(13) 


11 The reason for the definitions of a, b, and c can now be understood: we want (12) to be true. 
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is negative definite. If we use (12), then (13) becomes 

-(x 2 + y 2 ) + (ax + hy)f(x,y) + (bx + cy)g(x,y); 

and by introducing polar coordinates we can write this as 

-r 2 + r[(a cos 0 + b sin 0)/(x,y) + (b cos 0 + c sin Q)g(x,y)\ 

Denote the largest of the numbers \a\, \b\, |c| by K. Our assumption (6) now 
implies that 




and 


ls(*/y)l< 


r 

6 K 


for all sufficiently small r > 0, so 


8E r dE„ 2 410 

— F + — G<-r + - 

dx dy 6K 



for these r's. Thus E(x,y) is a positive definite function with the property that 
(13) is negative definite. Theorem 61-A now implies that (0,0) is an asymptoti¬ 
cally stable critical point of (4), and the proof is complete. 


To illustrate this theorem, we again consider the nonlinear system (7) of 
Example 1, whose related linear system is (8). For (8) we have p = 1 > 0 and 
q = 1 > 0, so the critical point (0,0) is asymptotically stable, both for the linear 
system (8) and for the nonlinear system (7). 


Example 2. We know from Section 58 that the equation of motion for the 
damped vibrations of a pendulum is 


d 2 x c dx 

0 H---h 

dt 2 m dt 


O’ 

—sinx = 0, 
a 


where c is a positive constant. The equivalent nonlinear system is 

dx 

— = y 

dt (14) 


dy g . c 

— = --smi- y. 

^dt a m 
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Let us now write (14) in the form 


dx 

dt 


= y 


dy S c S / . \ 

—— — ——x - y + — (x-smx). 

dt a m a 


(15) 


It is easy to see that 


x - sin x 

i — —>0 

V* 2 + y 2 


as (x,y) (0,0), for if x ^ 0, we have 


|x-sinx| |x-sinx| 

V + y 2 " M 

and since (0,0) is evidently an isolated critical point of the related linear 
system 


l-^UO; 

x 


dx 

, dt (16) 

dy g c 

— = -—x - y, 

. dt a m 

it follows that (0,0) is a simple critical point of (15). Inspection shows 
(p=c/m > 0 and q=g/a > 0) that (0,0) is an asymptotically stable critical 
point of (16), so by Theorem B it is also an asymptotically stable critical 
point of (15). This reflects the obvious physical fact that if the pendulum 
is slightly disturbed, then the resulting motion will die out with the pas¬ 
sage of time. 


Problems 

1. Prove that if (0,0) is a simple critical point of (4), then it is necessar¬ 
ily isolated. Hint: Write conditions (6) in the form f(x,\j)/r = e x -> 0 and 
g(x,y)/r -e 2 ^ 0, and in the light of (5) use polar coordinates to deduce a 
contradiction from the assumption that the right sides of (4) both vanish 
at points arbitrarily close to the origin but different from it. 
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2. Sketch the family of curves whose polar equation is r=a sin 20 (see 
Figure 90), and express the differential equation of this family in the 
form dy/dx = G(x,y)/E(x,y). 

3. If (0,0) is a simple critical point of (4) and q=a 1 b 2 - a 2 b } < 0, then 
Theorem A implies that (0,0) is a saddle point of (4) and is therefore 
unstable. Prove that if p = -(a 1 + fr 2 ) < 0 and q = a 1 b 2 - a 2 b l > 0, then (0,0) is 
an unstable critical point of (4). Hint: Adapt the proof of Theorem B to 
show that there exists a positive definite function E(x,y) such that 

—(fl 1 x + Diy) +— (a 2 x + b 2 y) = x +y , 
dx dy 


and apply Problem 61-4. (Observe that these facts together with Theorem 
B demonstrate that all the information in Figure 86 about asymptotic 
stability and instability carries over directly to nonlinear systems with 
simple critical points from their related linear systems.) 

4. Show that (0,0) is an asymptotically stable critical point of 


dx 

It 

dy_ 
. dt 


= -y-x 


3 


~ x-y 3 , 


but is an unstable critical point of 


dx 
dt 
dy 
_ dt 


= -y + x 3 
= x + y 3 . 


Flow are these facts related to the parenthetical remark in Problem 3? 

5. Verify that (0,0) is a simple critical point for each of the following sys¬ 
tems, and determine its nature and stability properties: 

, . dx „ 

(a) — = x + y - 2 xy 

dt 

= -2x + y + 3y 2 ; 


(b) 


dx 

dt 

dt 


- = -x- 


y-3x y 
= -2x-4y + i/sinx. 






Nonlinear Equations 


557 


6. The van der Pol equation 

d 2 x , 2 . s dx n 
+ -1) — 7 + x = 0 

Clt Clt 

is equivalent to the system 

dx 



^ = -x-q(x 2 -l )y. 

„ dr 

Investigate the stability properties of the critical point (0,0) for the cases 
q > 0 and p < 0. 


63 Nonlinear Mechanics. Conservative Systems 

It is well known that energy is dissipated in the action of any real dynamical 
system, usually through some form of friction. However, in certain situa¬ 
tions this dissipation is so slow that it can be neglected over relatively short 
periods of time. In such cases we assume the law of conservation of energy, 
namely, that the sum of the kinetic energy and the potential energy is con¬ 
stant. A system of this kind is said to be conservative. Thus the rotating earth 
can be considered a conservative system over short intervals of time involv¬ 
ing only a few centuries, but if we want to study its behavior throughout 
millions of years we must take into account the dissipation of energy by tidal 
friction. 

The simplest conservative system consists of a mass m attached to a spring 
and moving in a straight line through a vacuum. If x denotes the displace¬ 
ment of m from its equilibrium position, and the restoring force exerted on m 
by the spring is -kx where k> 0, then we know that the equation of motion is 


dt 2 

A spring of this kind is called a linear spring because the restoring force is a 
linear function of x. If m moves through a resisting medium, and the resis¬ 
tance (or damping force) exerted on m is -c(dx/dt) where c > 0, then the equa¬ 
tion of motion of this nonconservative system is 
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Here we have linear damping because the damping force is a linear function 
of dx/dt. By analogy, if/and g are arbitrary functions with the property that 
/(0) = 0 and y(0) = 0, then the more general equation 

d 2 x (dx'] „ 


can be interpreted as the equation of motion of a mass m under the action of 
a restoring force -f(x) and a damping force -g(dx/dt). In general these forces are 
nonlinear, and equation (1) can be regarded as the basic equation of nonlin¬ 
ear mechanics. In this section we shall briefly consider the special case of a 
nonlinear conservative system described by the equation 

+ /(*) = 0, (2) 


in which the damping force is zero and there is consequently no dissipation 
of energy. 12 

Equation (2) is equivalent to the autonomous system 


dx 



dy = fix) 
. dt m 


(3) 


If we eliminate dt, we obtain the differential equation of the paths of (3) in 
the phase plane. 


dy /(*) 

dx my 

and this can be written in the form 

my dy = -f(x) dx. 

If x = x 0 and y = y 0 when t = t 0 , then integrating (5) from t 0 to t yields 


X 



*0 


(4) 

(5) 


12 Extensive discussions of (1), with applications to a variety of physical problems, can be 
found in J. J. Stoker, Nonlinear Vibrations, Interscience-Wiley, New York, 1950; and in A. A. 
Andronow and C. E. Chaikin, Theory of Oscillations, Princeton University Press, Princeton, 
N.J., 1949. 
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or 


1 2 

—my + 


^f{x)dx = ^myl + 


*0 

J* f(x)dx. 


( 6 ) 


To interpret this result, we observe that 
energy of the dynamical system and 


1 1 

— my 2 = —m(dx/dt) 2 is the kinetic 


X 

V(x) = jf(x)dx (7) 

o 

is its potential energy Equation (6) therefore expresses the law of conserva¬ 
tion of energy. 


| my 2 + V(x) = E , (8) 

where £ = \my\ + V(x 0 ) is the constant total energy of the system. It is clear 
that (8) is the equation of the paths of (3), since we obtained it by solving (4). 
The particular path determined by specifying a value of £ is a curve of con¬ 
stant energy in the phase plane. The critical points of the system (3) are the 
points (x c ,0) where the x c are the roots of the equation/(x) = 0. As we pointed 
out in Section 58, these are the equilibrium points of the dynamical system 
described by (2). It is evident from (4) that the paths cross the x-axis at right 
angles and are horizontal when they cross the lines x=x c . Equation (8) also 
shows that the paths are symmetric with respect to the x-axis. 

If we write (8) in the form 


y = ±. 


-[E-V(x)] t 

m 


(9) 


then the paths can be constructed by the following easy steps. First, estab¬ 
lish an xz-plane with the z-axis on the same vertical line as the y-axis of the 
phase plane (Figure 91). Next, draw the graph of z = V(x) and several hori¬ 
zontal lines z = £ in the xz-plane (one such line is shown in the figure), and 
observe the geometric meaning of the difference £ - V(x). Finally, for each x, 
multiply £ - V(x) as obtained in the preceding step by 1/m and use formula 
(9) to plot the corresponding values of y in the phase plane directly below. 
Note that since dx/dt-y, the positive direction along any path is to the right 
above the x-axis and to the left below this axis. 
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FIGURE 91 


Example 1. We saw in Section 58 that the equation of motion of an 
undamped pendulum is 


d 2 x 
dt 1 


+ Asinx = 0, 


( 10 ) 


where A: is a positive constant. Since this equation is of the form (2), it can 
be interpreted as describing the undamped rectilinear motion of a unit 
mass under the influence of a nonlinear spring whose restoring force is 
-k sin x. The autonomous system equivalent to (10) is 
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— = -ksinx, 
_ dt 


( 11 ) 


and its critical points are (0,0), (±Jt,0), (±2jt,0), .... The differential equa¬ 
tion of the paths is 


dy _ k sinx 
dx y 

and by separating variables and integrating, we see that the equation of 
the family of paths is 


1 

— y 2 +(k-kcosx) = E. 

This is evidently of the form (8), where m = 1 and 

X 

V{x) = J/(x)dx = k -kcosx 
o 

is the potential energy. We now construct the paths by first drawing 
the graph of z = V(x) and several lines z = E in the xz-plane (Figure 92, 
where z = E = 2k is the only line shown). From this we read off the' val¬ 
ues E - V(x) and sketch the paths in the phase plane directly below by 
using y = ±yj2[E-V(x)]. It is clear from this phase portrait that if the 
total energy E is between 0 and 2k, then the corresponding paths are 
closed and equation (10) has periodic solutions. On the other hand, if 
E > 2k, then the path is not closed and the corresponding solution of 
(10) is not periodic. The value E = 2k separates the two types of motion, 
and for this reason a path corresponding to E = 2k is called a separa- 
trix. The wavy paths outside the separatrices correspond to whirling 
motions of the pendulum, and the closed paths inside to oscillatory 
motions. It is evident that the critical points are alternately unstable 
saddle points and stable but not asymptotically stable centers. For the 
sake of contrast, it is interesting to consider the effect of transforming 
this conservative dynamical system into a nonconservative system by 
introducing a linear damping force. The equation of motion then takes 
the form 


d 2 x dx , . 

—z- + c-hfcsinx = 0, c>0, 

dt 2 dt 

and the configuration of the paths is suggested in Figure 93. We find that 
the centers in Figure 92 become asymptotically stable spirals, and also 
that every path—except the separatrices entering the saddle points as 
t -> °° ultimately winds into one of these spirals. 
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FIGURE 92 



FIGURE 93 
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Problems 


1. If/(0) = 0 and x/(x) > 0 for x # 0, show that the paths of 


d 2 x 
dt 2 


+ /(x) = 0 


are closed curves surrounding the origin in the phase plane; that is, 
show that the critical point x = 0, y = dx/dt = 0 is a stable but not asymp¬ 
totically stable center. Describe this critical point with respect to its 
nature and stability if/(0) = 0 and x/(x) < 0 for x / 0. 

2. Most actual springs are not linear. A nonlinear spring is called hard or soft 
according as the magnitude of the restoring force increases more rapidly 
or less rapidly than a linear function of the displacement. The equation 


d 2 x 

dt 2 


+ kx + ax 3 = 0, 


k > 0 , 


describes the motion of a hard spring if a > 0 and a soft spring if a < 0. 
Sketch the paths in each case. 

3. Find the equation of the paths of 

d 2 x _ o „ 

—p- - x + 2x = 0, 

and sketch these paths in the phase plane. Locate the critical points and 
determine the nature of each. 

4. Since by equation (7) we have dV/dx -fix), the critical points of (3) are 
the points on the x-axis in the phase plane at which V'(x)=0. In terms 
of the curve z = V(x )—if this curve is smooth and well behaved—there 
are three possibilities: maxima, minima, and points of inflection. Sketch 
all three possibilities, and determine the type of critical point associated 
with each (a critical point of the third type is called a cusp). 


64 Periodic Solutions. The Poincare-Bendixson Theorem 

Consider a nonlinear autonomous system 


dx 

dt 


= F(x,y) 




( 1 ) 
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in which the functions F(x,y) and G(x,y ) are continuous and have continuous 
first partial derivatives throughout the phase plane. Our work so far has told 
us practically nothing about the paths of (1) except in the neighborhood of 
certain types of critical points. However, in many problems we are much 
more interested in the global properties of paths than we are in these local 
properties. Global properties of paths are those that describe their behavior 
over large regions of the phase plane, and in general they are very difficult 
to establish. 

The central problem of the global theory is that of determining whether 
(1) has closed paths. As we remarked in Section 58, this problem is 
important because of its close connection with the issue of whether (1) has 
periodic solutions. A solution x(t) and y(t) of (1) is said to be periodic if nei¬ 
ther function is constant, if both are defined for all f, and if there exists a 
number T > 0 such that x(t + T) = x(t) and y(t + T) = y(t) for all t. The smallest 
T with this property is called the period of the solution. 13 It is evident that 
each periodic solution of (1) defines a closed path that is traversed once 
as f increases from t 0 to t 0 +T for any t 0 . Conversely, it is easy to see that 
if C = [x{t),y{t)\ is a closed path of (1), then x(t), y(t) is a periodic solution. 
Accordingly, the search for periodic solutions of (1) reduces to a search for 
closed paths. 

We know from Section 60 that a linear system has closed paths if and 
only if the roots of the auxiliary equation are pure imaginary, and in this 
case every path is closed. Thus, for a linear system, either every path is 
closed or else no path is closed. On the other hand, a nonlinear system can 
perfectly well have a closed path that is isolated, in the sense that no other 
closed paths are near to it. The following is a well-known example of such 
a system: 


^ = -y + X (l-x 2 -y 2 ) 

~77 = x+ y(i -x 2 - y 2 )- 

l at 


( 2 ) 


To solve this system we introduce polar coordinates r and 0, where 
x-r cos 0 and y-r sin 0. If we differentiate the relations x 2 + y 2 - r 2 and 0 = tan* 1 
(y/x), we obtain the useful formulas 



(3) 


13 Every periodic solution has a period in this sense. Why? 
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On multiplying the first equation of (2) by x and the second by y, and adding, 
we find that 


r^ = r 2 (l-r 2 ). (4) 

at 

Similarly, if we multiply the second by x and the first by y, and subtract, we 
get 


2 dQ 
r — 
dt 


= r 


(5) 


The system (2) has a single critical point at r = 0. Since we are concerned only 
with finding the paths, we may assume that r > 0. In this case, (4) and (5) 
show that (2) becomes 


dr 

It 
dQ 
. dt 


= r(l-r 2 ) 

= 1 . 


( 6 ) 


These equations are easy to solve separately, and the general solution of the 
system (6) is found to be 


< r 

e = f+f 0 . 

The corresponding general solution of (2) is 

r _ cos (t + tp) 
\ll + ce~ 2t 

< 

_ sin(f + t 0 ) 
\ll + ce~ 2t 


(7) 


( 8 ) 


Let us analyze (7) geometrically (Figure 94). If c- 0, we have the solutions 
r = l and Q = t + t 0 , which trace out the closed circular path x 2 + y 2 =l in the 
counterclockwise direction. If c < 0, it is clear that r > 1 and that r 1 as 
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FIGURE 94 


t Also, if c > 0, we see that r < 1, and again r -* 1 as f These observa¬ 
tions show that there exists a single closed path (r= 1) which all other paths 
approach spirally from the outside or the inside as t -> 

In the above discussion we have shown that the system (2) has a closed 
path by actually finding such a path. In general, of course, we cannot hope 
to be able to do this. What we need are tests that make it possible for us to 
conclude that certain regions of the phase plane do or do not contain closed 
paths. Our first test is given in the following theorem of Poincare. A proof is 
sketched in Problem 1. 


Theorem A. A closed path of the system (1) necessarily surrounds at least one criti¬ 
cal point of this system. 


This result gives a negative criterion of rather limited value: a system with¬ 
out critical points in a given region cannot have closed paths in that region. 

Our next theorem provides another negative criterion, and is due to 
Bendixson. 14 


14 Ivar Otto Bendixson (1861-1935) was a Swedish mathematician who published one impor¬ 
tant memoir in 1901 supplementing some of Poincare's earlier work. He served as professor 
(and later as president) at the University of Stockholm, and was an energetic long-time mem¬ 
ber of the Stockholm City Council. 
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Theorem B. If dE/dx + dG/dy is always positive or always negative in a certain 
region of the phase plane, then the system (1) cannot have closed paths in that 
region. 

Proof. Assume that the region contains a closed path C = [x{t),y{t)\ with inte¬ 
rior R. Then Green's theorem and our hypothesis yield 




5G" 

3 /, 


dxdy * 0. 


However, along C we have dx=F dt and dy = G dt, so 

T 

J(Fdy-Gdx) = J(FG-GF)df = 0. 

c o 


This contradiction shows that our initial assumption is false, so the region 
under consideration cannot contain any closed path. 


These theorems are sometimes useful, but what we really want are posi¬ 
tive criteria giving sufficient conditions for the existence of closed paths of 
(1). One of the few general theorems of this kind is the classical Poincare- 
Bendixson theorem, which we now state without proof. 15 


Theorem C. Let Rbe a bounded region of the phase plane together with its bound¬ 
ary, and assume that R does not contain any critical points of the system (1). 
If C- [x(f),y(f)] is a path of { 1) that lies in Rfor some t 0 and remains in Rfor all t > t 0 , 
then C is either itself a closed path or it spirals toward a closed path as t °°. Thus 
in either case the system (1) has a closed path in R. 


In order to understand this statement, let us consider the situation suggested 
in Figure 95. Here R consists of the two dashed curves together with the ring- 
shaped region between them. Suppose that the vector 

V(x,y)=F(x,y)i + G(x,y)) 

points into R at every boundary point. Then every path C through a bound¬ 
ary point (at t = t 0 ) must enter R and can never leave it, and under these cir¬ 
cumstances the theorem asserts that C must spiral toward a closed path C 0 . 


15 For details, see Hurewicz, loc. cit., pp. 102-111, or Cesari, loc. cit., pp. 163-167. 
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FIGURE 95 


We have chosen a ring-shaped region R to illustrate the theorem because a 
closed path like C 0 must surround a critical point (P in the figure) and R must 
exclude all critical points. 

The system (2) provides a simple application of these ideas. It is clear that 
(2) has a critical point at (0,0), and also that the region R between the circles 
r = l/2 and r = 2 contains no critical points. In our earlier analysis we found 
that 


— = r(l-r 2 ) forr>0. 


dt 


This shows that dr/dt > 0 on the inner circle and dr/dt < 0 on the outer circle, 
so the vector V points into R at all boundary points. Thus any path through 
a boundary point will enter R and remain in R as t -* °°, and by the Poincare- 
Bendixson theorem we know that R contains a closed path C 0 . We have 
already seen that the circle r = 1 is the closed path whose existence is guar¬ 
anteed in this way. 

The Poincare-Bendixson theorem is quite satisfying from a theoretical 
point of view, but in general it is rather difficult to apply. A more practical 
criterion has been developed that assures the existence of closed paths for 
equations of the form 



(9) 
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which is called Lienard's equation. 16 When we speak of a closed path for such 
an equation, we of course mean a closed path of the equivalent system 


dx 

dt 


= y 




( 10 ) 


and as we know, a closed path of (10) corresponds to a periodic solution of 
(9). The fundamental statement about the closed paths of (9) is the following 
theorem. 


Theorem D. (Lienard's Theorem.) Let the functions f(x) and g(x) satisfy the fol¬ 
lowing conditions: (i) both are continuous and have continuous derivatives for all x; 
(ii) g(x) is an odd function such that g(x) > 0 for x > 0, andf(x) is an even function; 
and (iii) the odd function F(x) = Jo f(x)dx has exactly one positive zero at x = a, is 
negative for 0 <x<a,is positive and nondecreasing for x > a, and F(x) -»• °° as x -»• 
Then equation (9) has a unique closed path surrounding the origin in the phase plane, 
and this path is approached spirally by every other path as t -> 


For the benefit of the skeptical and tenacious reader who is rightly reluc¬ 
tant to accept unsupported assertions, a proof of this theorem is given in 
Appendix B. An intuitive understanding of the role of the hypotheses can 
be gained by thinking of (9) in terms of the ideas of the previous section. 
From this point of view, equation (9) is the equation of motion of a unit mass 
attached to a spring and subject to the dual influence of a restoring force 
-g(x) and a damping force -f(x) dx/dt. The assumption about g(x) amounts 
to saying that the spring acts as we would expect, and tends to diminish 
the magnitude of any displacement. On the other hand, the assumptions 
about/(x)—roughly, that f(x) is negative for small |x| and positive for large 
|x|—mean that the motion is intensified for small |x| and retarded for 
large \x\, and therefore tends to settle down into a steady oscillation. This 
rather peculiar behavior of f(x) can also be expressed by saying that the 
physical system absorbs energy when |x| is small and dissipates it when 
| x | is large. 


16 Alfred Lienard (1869-1958) was a French scientist who spent most of his career teaching 
applied physics at the School of Mines in Paris, of which he became director in 1929. His 
physical research was mainly in the areas of electricity and magnetism, elasticity, and 
hydrodynamics. From time to time he worked on mathematical problems arising from his 
other scientific investigations, and in 1933 was elected president of the French Mathematical 
Society. He was an unassuming bachelor whose life was devoted entirely to his work and his 
students. 
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The main application of Lienard's theorem is to the van der Pol 17 equation 

rl^Y ply 

^4 + p(x 2 -l) —+ x = 0, (11) 

dt 2 at 

where (i is assumed to be a positive constant for physical reasons. Here 
f(x) = |r(x 2 - 1) and g(x)-x, so condition (i) is clearly satisfied. It is equally clear 
that condition (ii) is true. Since 


F(x) = pj^x 3 - *j = ^px(x 2 -3), 

we see that F(x) has a single positive zero at x = -J3, is negative for 0 < x < V3, 
is positive for x > V3, and that F(x) -> °° as x -*■ °°. Finally, F'(x) = p(x 2 - 1) is posi¬ 
tive for x > 1, so F(x) is certainly nondecreasing (in fact, increasing) for x > -J3. 
Accordingly, all the conditions of the theorem are met, and we conclude that 
equation (11) has a unique closed path (periodic solution) that is approached 
spirally (asymptotically) by every other path (nontrivial solution). 


Problems 

1. A proof of Theorem A can be built on the following geometric ideas 
(Figure 96). Let C be a simple closed curve (not necessarily a path) in 
the phase plane, and assume that C does not pass through any critical 
point of the system (1). If P = (x,y) is a point on C, then 

V(x,y)=F(x,y)i + G(x,y)j 

is a nonzero vector, and therefore has a definite direction given by the 
angle 0. If P moves once around C in the counterclockwise direction, 
the angle 0 changes by an amount A0 = 2nn, where n is a positive integer, 
zero, or a negative integer. This integer n is called the index of C. If C 
shrinks continuously to a smaller simple closed curve C 0 without pass¬ 
ing over any critical point, then its index varies continuously; and since 
the index is an integer, it cannot change. 


17 Balthasar van der Pol (1889-1959), a Dutch scientist specializing in the theoretical aspects of 
radioengineering, initiated the study of equation (11) in the 1920s, and thereby stimulated 
Lienard and others to investigate the mathematical theory of self-sustained oscillations in 
nonlinear mechanics. 
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FIGURE 96 


(a) If C is a path of (1), show that its index is 1. 

(b) If C is a path of (1) that contains no critical points, show that a small 
C 0 has index 0, and from this infer Theorem A. 

2. Consider the nonlinear autonomous system 

— = 4x + 4y-x(x * 2 3 + y 2 ) 

< 

^ = -4x + 4 y-y(x 2 + y 2 ). 

[dr 


(a) Transform the system into polar coordinate form. 

(b) Apply the Poincare-Bendixson theorem to show that there is a 
closed path between the circles r = 1 and r- 3. 

(c) Find the general nonconstant solution x = x(t) and y = y(t) of the orig¬ 
inal system, and use this to find a periodic solution corresponding 
to the closed path whose existence was established in (b). 

(d) Sketch the closed path and at least two other paths in the phase 
plane. 

3. Show that the nonlinear autonomous system 


dx 

dt 

dy 

At 


= 3x-j j -xe x2+yl 
= x + 3y-ye x2+ ' j2 


has a periodic solution. 
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4. In each of the following cases use a theorem of this section to determine 
whether or not the given differential equation has a periodic solution: 


(a) 

(b) 

(c) 

(d) 

(e) 


dx ,_ 4 Ax 


dt 

d 2 x 


2 + (5x -9x ) —+ x =0; 


dt 


(x 2 + 1) — + x 5 = 0; 
dt 2 v ' dt 


d 2 x 

dt 2 


-(1 + x 2 ) = 0; 


d 2 x dx f dx . „ , _ 

+ — + — -3x* =0; 


dt 2 dt {dt, 
d 2 x h dx n dx 


dt 2 


dx 

dt 


dt 


5. Show that any differential equation of the form 


d 2 x 


. dx 
dt 


a — 2 - + fr(x -1)— + cx = 0 (a, b, c > 0) 


can be transformed into the van der Pol equation by a change of the 
independent variable. 


65 More about the van der Pol Equation 

First, a bit history. In World War II, in the fall of 1940, Hitler's German Army 
had swept across France and was poised on the coast of the English Channel, 
ready to invade England and complete its conquest of Western Europe. To do 
this they needed control of the air, and their Air Force was ready to attack 
and destroy London and Southeast England. All that stood in their way was 
the British Royal Air Force (R.A.F) and its small number of young fighter 
pilots. But with the help of the newly invented radar to tell them in advance 
where and when the German bombers were coming, the R.A.F. pilots suc¬ 
cessfully fought off the Germans and defeated Hitler's plans for conquest. 
This so-called Battle of Britain was a major turning point of the war, of which 
Winston Churchill said, "Never in the history of human conflict was so much 
owed by so many to so few." 

The detailed connection between radar and the van der Pol equation dis¬ 
cussed in Section 64 can only be understood by a skilled electrical engineer, 
which the present writer is not. However, it turned out that solutions of the 
equation were closely related to increasing difficulties in getting reliable 
radar information back from greater and greater distances. 
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The eminent theoretical physicist/mathematician Freeman Dyson was 
in England at the time, and has some interesting memories of these events. 
Dyson came to America in 1947 and has been a permanent Professor at The 
Institute for Advanced Study in Princeton, New Jersey since 1953. Professor 
Dyson's recollections (1996) are as follows: 

In 1942 when I was a student in Cambridge, I heard a lecture by Mary 
Cartwright about the van der Pol equation. Cartwright had been work¬ 
ing with Littlewood on the solutions of the equation, which describe the 
output of a nonlinear radio amplifier when the input is a pure sine wave. 
The whole development of radar in World War II depended on high-power 
amplifiers, and it was a matter of life and death to have amplifiers that did 
what they were supposed to do. The soldiers were plagued with amplifiers 
that misbehaved and blamed the manufacturers for their erratic behav¬ 
ior. Cartwright and Littlewood discovered that the manufacturers were 
not to blame. The equation itself was to blame. They discovered that as 
you raise the gain of the amplifier, the solutions of the equation become 
more and more irregular. At low power the solution has the same period 
as the input, but as the power increases you see solutions with double the 
period, then with quadruple the period, and finally you have solutions 
that are not periodic at all. Cartwright and Littlewood explored the behav¬ 
ior of solutions in detail and discovered the phenomena that later became 
known as "chaos." They published all this in a paper in the Journal of the 
London Mathematical Society, which appeared in 1945. That was a bad year 
to publish. Paper in England was scarce and few copies of the Journal 
were printed. Mathematicians everywhere were still busy fighting the 
war. The paper attracted no attention. In 1949 Mary Cartwright came to 
Princeton and talked about the work again. Again she attracted no atten¬ 
tion. Littlewood was not helpful. In the foreword to Littlewood's collected 
papers is a description written by Littlewood about his collaboration with 
Cartwright: 

"Two rats fell into a can of milk. After swimming for a time one of them 
realized his hopeless fate and drowned. The other persisted, and at last 
the milk turned to butter and he could get out." 

Littlewood does not say whether the rat who drowned was himself or 
Cartwright. In either case the passage makes clear that Littlewood did not 
understand the importance of the work that he and Cartwright had done. 
Only Cartwright understood it, and she is not a person who likes to blow her 
own trumpet. She put the van der Pol equation to one side and went on to a 
distinguished career in analytic function theory and university administra¬ 
tion. She became President of the London Mathematical Society in 1961, and 
Dame Mary (the female equivalent of a knighthood) in 1969. By that time, 
the phenomena of chaos had been rediscovered. A few years later, they were 
given their modern names. 
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Appendix A. Poincare 

Jules Henri Poincare (1854-1912) was universally recognized at the begin¬ 
ning of the twentieth century as the greatest mathematician of his genera¬ 
tion. He began his academic career at Caen in 1879, but only two years later 
he was appointed to a professorship at the Sorbonne. He remained there 
for the rest of his life, lecturing on a different subject each year. In his lec¬ 
tures—which were edited and published by his students—he treated with 
great originality and mastery of technique virtually all known fields of 
pure and applied mathematics, and many that were not known until he 
discovered them. Altogether he produced more than 30 technical books on 
mathematical physics and celestial mechanics, half a dozen books of a more 
popular nature, and almost 500 research papers on mathematics. He was a 
quick, powerful, and restless thinker, not given to lingering over details, and 
was described by one of his contemporaries as "a conquerer, not a colonist." 
He also had the advantage of a prodigious memory, and habitually did his 
mathematics in his head as he paced back and forth in his study, writing it 
down only after it was complete in his mind. He was elected to the Academy 
of Sciences at the very early age of thirty-two. The academician who pro¬ 
posed him for membership said that "his work is above ordinary praise, 
and reminds us inevitably of what Jacobi wrote of Abel—that he had settled 
questions which, before him, were unimagined." 

Poincare's first great achievement in mathematics was in analysis. He gen¬ 
eralized the idea of the periodicity of a function by creating his theory of auto- 
morphic functions. The elementary trigonometric and exponential functions 
are singly periodic, and the elliptic functions are doubly periodic. Poincare's 
automorphic functions constitute a vast generalization of these, for they are 
invariant under a countably infinite group of linear fractional transforma¬ 
tions and include the rich theory of elliptic functions as a detail. He applied 
them to solve linear differential equations with algebraic coefficients, and 
also showed how they can be used to uniformize algebraic curves, that is, 
to express the coordinates of any point on such a curve by means of single¬ 
valued functions x(t) and y(t) of a single parameter t. In the 1880s and 1890s 
automorphic functions developed into an extensive branch of mathematics, 
involving (in addition to analysis) group theory, number theory, algebraic 
geometry, and non-Euclidean geometry. 

Another focal point of his thought can be found in his researches into celes¬ 
tial mechanics (Les Methodes Nouvelle de la Mecanique Celeste, three volumes, 
1892-1899). In the course of this work he developed his theory of asymptotic 
expansions (which kindled interest in divergent series), studied the stability 
of orbits, and initiated the qualitative theory of nonlinear differential equa¬ 
tions. His celebrated investigations into the evolution of celestial bodies led 
him to study the equilibrium shapes of a rotating mass of fluid held together 
by gravitational attraction, and he discovered the pear-shaped figures that 
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played an important role in the later work of Sir G. H. Darwin (Charles' 
son). 18 In Poincare's summary of these discoveries, he writes: "Let us imag¬ 
ine a rotating fluid body contracting by cooling, but slowly enough to remain 
homogeneous and for the rotation to be the same in all its parts. At first very 
approximately a sphere, the figure of this mass will become an ellipsoid of 
revolution which will flatten more and more, then, at a certain moment, it 
will be transformed into an ellipsoid with three unequal axes. Later, the fig¬ 
ure will cease to be an ellipsoid and will become pear-shaped until at last 
the mass, hollowing out more and more at its 'waist,' will separate into two 
distinct and unequal bodies." These ideas have gained additional interest 
in our own time; for with the aid of artificial satellites, geophysicists have 
recently found that the earth itself is slightly pear-shaped. 

Many of the problems he encountered in this period were the seeds of new 
ways of thinking, which have grown and flourished in twentieth-century 
mathematics. We have already mentioned divergent series and nonlinear dif¬ 
ferential equations. In addition, his attempts to master the qualitative nature 
of curves and surfaces in higher dimensional spaces resulted in his famous 
memoir Analysis situs (1895), which most experts agree marks the beginning 
of the modern era in algebraic topology. Also, in his study of periodic orbits 
he founded the subject of topological (or qualitative) dynamics. The type of 
mathematical problem that arises here is illustrated by a theorem he conjec¬ 
tured in 1912 but did not live to prove: if a one-to-one continuous transfor¬ 
mation carries the ring bounded by two concentric circles into itself in such 
a way as to preserve areas and to move the points of the inner circle clock¬ 
wise and those of the outer circle counterclockwise, then at least two points 
must remain fixed. This theorem has important applications to the classical 
problem of three bodies (and also to the motion of a billiard ball on a convex 
billiard table). A proof was found in 1913 by Birkhoff, a young American 
mathematician. 19 Another remarkable discovery in this field, now known as 
the Poincare recurrence theorem, relates to the long-range behavior of con¬ 
servative dynamical systems. This result seemed to demonstrate the futility 
of contemporary efforts to deduce the second law of thermodynamics from 
classical mechanics, and the ensuing controversy was the historical source 
of modern ergodic theory. 

One of the most striking of Poincare's many contributions to mathemati¬ 
cal physics was his famous paper of 1906 on the dynamics of the electron. 
He had been thinking about the foundations of physics for many years, and 
independently of Einstein had obtained many of the results of the special 
theory of relativity. 20 The main difference was that Einstein's treatment 
was based on elemental ideas relating to light signals, while Poincare's was 


18 See G. H. Darwin, The Tides, chap. XVIII, Houghton Mifflin, Boston, 1899. 

19 See G. D. Birkhoff, Dynamical Systems, chap. VI, American Mathematical Society Colloquium 
Publications, vol. IX, Providence, R.I., 1927. 

20 A discussion of the historical background is given by Charles Scribner, Jr., "Henri Poincare 
and the Principle of Relativity," Am. J. Phys., vol. 32, p. 672 (1964). 
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founded on the theory of electromagnetism and was therefore limited in its 
applicability to phenomena associated with this theory. Poincare had a high 
regard for Einstein's abilities, and in 1911 recommended him for his first aca¬ 
demic position. 21 

In 1902 he turned as a side interest to writing and lecturing for a wider 
public, in an effort to share with nonspecialists his enthusiasm for the mean¬ 
ing and human importance of mathematics and science. These lighter works 
have been collected in four books. La Science et VHypothese (1903), La Valeur 
de la Science (1904), Science et Methode (1908), and Dernieres Pensees (1913). 22 
They are clear, witty, profound, and altogether delightful, and show him to 
be a master of French prose at its best. In the most famous of these essays, 
the one on mathematical discovery, he looked into himself and analyzed his 
own mental processes, and in so doing provided the rest of us with some 
rare glimpses into the mind of a genius at work. As Jourdain wrote in his 
obituary, "One of the many reasons for which he will live is that he made it 
possible for us to understand him as well as to admire him." 

At the present time mathematical knowledge is said to be doubling every 
10 years or so, though some remain skeptical about the permanent value 
of this accumulation. It is generally believed to be impossible now for any 
human being to understand thoroughly more than one or two of the four 
main subdivisions of mathematics—analysis, algebra, geometry, and num¬ 
ber theory—to say nothing of mathematical physics as well. Poincare had 
creative command of the whole of mathematics as it existed in his day, and 
he was probably the last man who will ever be in this position. 


Appendix B. Proof of Lienard's Theorem 

Consider Lienard's equation 

£+/w£+*w-o, a) 

and assume that/(x) and g(x) satisfy the following conditions: (i) fix) and g(x) 
are continuous and have continuous derivatives; (ii) g(x) is an odd function 
such that g(x) > 0 for x > 0, and f(x) is an even function; and (iii) the odd 
function F(x) = f(x)dx has exactly one positive zero at x-a, is negative for 
0 < x < a, is positive and nondecreasing for x > a, and F(x) -> °° as x -*■ °°. 
We shall prove that equation (1) has a unique closed path surrounding the 


21 See M. Lincoln Schuster (ed.), A Treasury of the World's Great Letters, p. 453, Simon and 
Schuster, New York, 1940. 

22 All have been published in English translation by Dover Publications, New York. 
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origin in the phase plane, and that this path is approached spirally by every 
other path as t -*■ 

The system equivalent to (1) in the phase plane is 

dx 

■ f ( 2 ) 

j t =-g(x)-f(x)y. 

By condition (i), the basic theorem on the existence and uniqueness of solu¬ 
tions holds. It follows from condition (ii) that y(0) = 0 and g{x) # 0 for x # 0, so 
the origin is the only critical point. Also, we know that any closed path must 
surround the origin. The fact that 


d 2 x 

dt 2 


+ /(*) 


dx 

dt 


d_ 

dt 


dx 
— + 


dt 


j ’ f(x)dx 
o 




suggests introducing a new variable, 

z = y + F(x). 

With this notation, equation (1) is equivalent to the system 

' dx 


= z-F(x) 

= ~g(x) 


(3) 


in the xz-plane. Again we see that the existence and uniqueness theorem 
holds, that the origin is the only critical point, and that any closed path must 
surround the origin. The one-to-one correspondence ( x,y ) <-» (x,z) between 
the points of the two planes is continuous both ways, so closed paths corre¬ 
spond to closed paths and the configurations of the paths in the two planes 
are qualitatively similar. The differential equation of the paths of (3) is 

rfz _ -g( X ) (M 

dx z - F(x) 


These paths are easier to analyze than their corresponding paths in the phase 
plane, for the following reasons. 
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First, since both g(x) and F(x) are odd, equations (3) and (4) are unchanged 
when x and z are replaced by -x and -z. This means that any curve sym¬ 
metric to a path with respect to the origin is also a path. Thus if we know the 
paths in the right half-plane (x > 0), those in the left half-plane (x < 0) can be 
obtained at once by reflection through the origin. 

Second, equation (4) shows that the paths become horizontal only as they 
cross the z-axis, and become vertical only as they cross the curve z = F(x). 
Also, an inspection of the signs of the right sides of equations (3) shows 
that all paths are directed to the right above the curve z = F(x) and to the 
left below this curve, and move downward or upward according as x > 0 or 
x < 0. These remarks mean that the curve z = F(x), the z-axis, and the verti¬ 
cal line through any point Q on the right half of the curve z = F(x) can be 
crossed only in the directions indicated by the arrows in Figure 97. Suppose 
that the solution of (3) defining the path C through Q is so chosen that the 
point Q corresponds to the value t = 0 of the parameter. Then as t increases 
into positive values, a point on C with coordinates x(t) and y(t) moves down 
and to the left until it crosses the z-axis at a point R ; and as t decreases into 
negative values, the point on C rises to the left until it crosses the z-axis at 
a point P. It will be convenient to let b be the abscissa of Q and to denote 
the path C by C b . 

It is easy to see from the symmetry property that when the path C b is con¬ 
tinued beyond P and R into the left half of the plane, the result will be a 
closed path if and only if the distances OP and OR are equal. To show that 



FIGURE 97 
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there is a unique closed path, it therefore suffices to show that there is a 
unique value of b with the property that OP = OR. 

To prove this, we introduce 


G(x) = j ’ g(x)dx 
o 


and consider the function 


E(x,z) = ^-z 2 + G(x), 


which reduces to z 2 /2 on the z-axis. Along any path we have 


dE , ,dx dz 

dt s dt dt 

r j.i dz dz 

-[Z-FMI- + Z- 

p/ \ dz 


SO 


dE = F dz. 


If we compute the line integral of F dz along the path C b from P to R, we 
obtain 


1(b) = jVdz = jdE = E R -E P = ^(OR 2 -OP 2 ), 

PR PR 

so it suffices to show that there is a unique b such that 1(b) = 0. 

If b < a, then F and dz are negative, so 1(b) > 0 and C b cannot be closed. 
Suppose now that b> a, as in Figure 97. We split 1(b) into two parts, 

Ii(b)= J Fdz + J Fdz and I 2 (b)=j*Fdz, 

PS TR ST 


so that 


m=m+i 2 (b)- 
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Since F and dz are negative as C h is traversed from P to S and from T to R, it is 
clear that Ifb) > 0. On the other hand, if we go from S to T along C h we have 
F > 0 and dz < 0, so J 2 (b) < 0. Our immediate purpose is to show that 1(b) is 
a decreasing function of b by separately considering Ifb) and I 2 (b). First, we 
note that equation (4) enables us to write 

Fdz = F^dx= -Z mx) dx. 
dx z-F(x) 

The effect of increasing b is to raise the arc PS and to lower the arc TR, which 
decreases the magnitude of [~g(x)F(x)]/[z - F(x)] for a given x between 0 and a. 
Since the limits of integration for Ifb ) are fixed, the result is a decrease in Ifb). 
Furthermore, since F(x) is positive and nondecreasing to the right of a, we see 
that an increase in b gives rise to an increase in the positive number -I 2 (b), 
and hence to a decrease in I 2 (b). Thus 1(b) = Ifb) + I 2 (b) is a decreasing function 
for b>a. We now show that I 2 (b) -> as b -> If L in Figure 97 is fixed and 
K is to the right of L, then 

I 2 (b)= hdz< f / dz < -(LM) ■ (LN) ; 

ST NK 

and since LN -> °° as b -> °°, we have I 2 (b) -> 

Accordingly, 1(b) is a decreasing continuous function of b for b > a, 1(a) > 0, 
and 1(b) -> as b It follows that 1(b) = 0 for one and only one b = b 0/ so 
there is one and only one closed path C bo . 

Finally, we observe that OR > OP for b < b 0 ; and from this and the symme¬ 
try we conclude that paths inside C bo spiral out to C bo . Similarly, the fact that 
OR < OP for b > b 0 implies that paths outside C bo spiral in to C bo . 
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The Calculus of Variations 


66 Introduction. Some Typical Problems of the Subject 

The calculus of variations has been one of the major branches of analysis for 
more than two centuries. It is a tool of great power that can be applied to a 
wide variety of problems in pure mathematics. It can also be used to express 
the basic principles of mathematical physics in forms of the utmost simplic¬ 
ity and elegance. 

The flavor of the subject is easy to grasp by considering a few of its typical 
problems. Suppose that two points P and Q are given in a plane (Figure 98). 
There are infinitely many curves joining these points, and we can ask which 
of these curves is the shortest. The intuitive answer is of course a straight line. 
We can also ask which curve will generate the surface of revolution of smallest 
area when revolved about the x-axis, and in this case the answer is far from 
clear. If we think of a typical curve as a frictionless wire in a vertical plane, 
then another nontrivial problem is that of finding the curve down which a 
bead will slide from P to Q in the shortest time. This is the famous brachisto¬ 
chrone problem of John Bernoulli, which we discussed in Section 6. Intuitive 
answers to such questions are quite rare, and the calculus of variations pro¬ 
vides a uniform analytical method for dealing with situations of this kind. 

Every student of elementary calculus is familiar with the problem of find¬ 
ing points at which a function of a single variable has maximum or mini¬ 
mum values. The above problems show that in the calculus of variations 
we consider some quantity (arc length, surface area, time of descent) that 
depends on an entire curve, and we seek the curve that minimizes the quan¬ 
tity in question. The calculus of variations also deals with minimum prob¬ 
lems depending on surfaces. For example, if a circular wire is bent in any 
manner and dipped into a soap solution, then the soap film spanning the 
wire will assume the shape of the surface of smallest area bounded by the 
wire. The mathematical problem is to find the surface from this minimum 
property and the known shape of the wire. 

In addition, the calculus of variations has played an important role as a 
unifying influence in mechanics and as a guide in the mathematical inter¬ 
pretation of many physical phenomena. For instance, it has been found that if 
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FIGURE 98 


the configuration of a system of moving particles is governed by their mutual 
gravitational attractions, then their actual paths will be minimizing curves 
for the integral, with respect to time, of the difference between the kinetic 
and potential energies of the system. This far-reaching statement of classical 
mechanics is known as Hamilton's principle after its discoverer. Also, in mod¬ 
ern physics, Einstein made extensive use of the calculus of variations in his 
work on general relativity, and Schrodinger used it to discover his famous 
wave equation, which is one of the cornerstones of quantum mechanics. 

A few of the problems of the calculus of variations are very old, and were 
considered and partly solved by the ancient Greeks. The invention of ordi¬ 
nary calculus by Newton and Leibniz stimulated the study of a number of 
variational problems, and some of these were solved by ingenious special 
methods. However, the subject was launched as a coherent branch of analy¬ 
sis by Euler in 1744, with his discovery of the basic differential equation for 
a minimizing curve. 

We shall discuss Euler's equation in the next section, but first we observe 
that each of the problems described in the second paragraph of this section 
is a special case of the following more general problem. Let P and Q have 
coordinates (x y t/,) and (x 2 , y 2 ), and consider the family of functions 


y = y(*) 


(i) 


that satisfy the boundary conditions y(x 1 ) = y and i/(x 2 ) = y 2 —that is, the graph 
of (1) must join P and Q. Then we wish to find the function in this family that 
minimizes an integral of the form 



( 2 ) 
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To see that this problem indeed contains the others, we note that the length 
of the curve (1) is 


Ja/i Hyfdx, 


X\ 


(3) 


and that the area of the surface of revolution obtained by revolving it about 
the x-axis is 


*2 

j*2 mjyjl + iy') 2 

X\ 


dx. 


(4) 


In the case of the curve of quickest descent, it is convenient to invert the 
coordinate system and take the point P at the origin, as in Figure 99. Since the 
speed v - ds/dt is given by v = ^J2gy , the total time of descent is the integral 
of ds/v and the integral to be minimized is 


1 




(5) 


Accordingly, the function/(x, y, y') occurring in (2) has the respective forms 
^/l + (y') 2 , 2m/^l + (y') 2 and ^jl + (y') 2 j\j2gy in our three problems. 

It is necessary to be somewhat more precise in formulating the basic prob¬ 
lem of minimizing the integral (2). First, we will always assume that the 
function f(x,y,y') has continuous partial derivatives of the second order with 
respect to x, y, and y'. The next question is. What types of functions (1) are 



FIGURE 99 
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to be allowed? The integral (2) is a well-defined real number whenever the 
integrand is continuous as a function of x, and for this it suffices to assume 
that y'(x) is continuous. However, in order to guarantee the validity of the 
operations we will want to perform, it is convenient to restrict ourselves once 
and for all to considering only unknown functions y(x) that have continuous 
second derivatives and satisfy the given boundary conditions y(x j) = y 1 and 
y(x 2 ) = y 2 . Functions of this kind will be called admissible. We can imagine a 
competition which only admissible functions are allowed to enter, and the 
problem is to select from this family the function or functions that yield the 
smallest value for I. 

In spite of these remarks, we will not be seriously concerned with issues 
of mathematical rigor. Our point of view is deliberately naive, and our sole 
purpose is to reach the interesting applications as quickly and simply as pos¬ 
sible. The reader who wishes to explore the very extensive theory of the sub¬ 
ject can readily do so in the systematic treatises. 1 


67 Euler's Differential Equation for an Extremal 

Assuming that there exists an admissible function y(x) that minimizes the 
integral 


*2 

i = j f(x,y,y')dx, 

*i 


(i) 


how do we find this function? We shall obtain a differential equation for 
y(x) by comparing the values of I that correspond to neighboring admissible 
functions. The central idea is that since y(x) gives a minimum value to I, I 
will increase if we "disturb" y(x) slightly. These disturbed functions are con¬ 
structed as follows. 

Let r|(x) be any function with the properties that q"(x) is continuous and 


n(*i) = nte) = 0. ( 2 ) 

If a is a small parameter, then 

y(x) = y(x) + ar\{x) (3) 


1 See, for example, I. M. Gelfand and S. V. Fomin, Catculus of Variations, Prentice-Hall, 
Englewood Cliffs, N.J., 1963; G. M. Ewing, Calculus of Variations with Applications, Norton, 
New York, 1969; or C. Caratheodory, Calculus of Variations and Partial Differential Equations of 
the First Order, Part IP Calculus of Variations, Holden-Day, San Francisco, 1967. 
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FIGURE 100 

represents a one-parameter family of admissible functions. The vertical 
deviation of a curve in this family from the minimizing curve y(x) is ar|(x), 
as shown in Figure 100. 2 The significance of (3) lies in the fact that for each 
family of this type, that is, for each choice of the function q(x), the minimiz¬ 
ing function y(x) belongs to the family and corresponds to the value of the 
parameter a = 0. 

Now, with r|(x) fixed, we substitute y(x) = y(x) + ar|(x)andy'(x) = y'(x) + ar)'(x) 
into the integral (1), and get a function of a. 




( 4 ) 


When a = 0, formula (3) yields y(x) = y(x); and since y(x) minimizes the inte¬ 
gral, we know that 1(a) must have a minimum when a = 0. By elementary 
calculus, a necessary condition for this is the vanishing of the derivative I'(d) 


2 The difference y — y = ar) is called the variation of the function y and is usually denoted by 
6 y. This notation can be developed into a useful formalism (which we do not discuss) and is 
the source of the name calculus of variations. 
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when a = 0: I'{ 0) = 0. The derivative I'(a) can be computed by differentiating 
(4) under the integral sign, that is. 


r a 

= ] da f( x AJ,y') dx - 


( 5 ) 


By the chain rule for differentiating functions of several variables, we have 

d _ df dx df dy df du' 

da J dx da dy da dy’ da 

= |ph(*) +Jzrh'M, 
dy dy 

so (5) can be written as 





dx. 


Now J'(0) = 0, so putting a = 0 in (6) yields 



dx = 0. 


( 6 ) 


( 7 ) 


In this equation the derivative r|'(x) appears along with the function q(x). We 
can eliminate q'( x ) by integrating the second term by parts, which gives 


]^ x)dx ' 


r|(x) 


df 


dy’ 

x 2 

Jri(x) 


X2 

JhW 


d 

dx 


f df' 
dy’ 


dx 


A A 

dx{ dy’j 


dx 


by virtue of (2). We can therefore write (7) in the form 

X2 

JhW 


df df df' 

Wj 


dy dx 


dx = 0. 


( 8 ) 
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Our reasoning up to this point is based on a fixed choice of the function q(x). 
However, since the integral in (8) must vanish for every such function, we at 
once conclude that the expression in brackets must also vanish. This yields 


d_( df' 
dx{dyf 



( 9 ) 


which is Euler's equation. 3 

It is important to have a clear understanding of the exact nature of our 
conclusion: namely, if y(x) is an admissible function that minimizes the inte¬ 
gral (1), then y satisfies Euler's equation. Suppose an admissible function y 
can be found that satisfies this equation. Does this mean that y minimizes I ? 
Not necessarily. The situation is similar to that in elementary calculus, where 
a function g(x) whose derivative is zero at a point x 0 may have a maximum, 
a minimum, or a point of inflection at x 0 . When no distinctions are made, 
these cases are often called stationary values of g(x), and the points x 0 at which 
they occur are stationary points. In the same way, the condition T(0) = 0 can 
perfectly well indicate a maximum or point of inflection for 1(a) at a = 0, 
instead of a minimum. Thus it is customary to call any admissible solution 
of Euler's equation a stationary function or stationary curve, and to refer to the 
corresponding value of the integral (1) as a stationary value of this integral— 
without committing ourselves as to which of the several possibilities actually 
occurs. Furthermore, solutions of Euler's equation which are unrestricted by 
the boundary conditions are called extremals. 

In calculus we use the second derivative to give sufficient conditions dis¬ 
tinguishing one type of stationary value from another. Similar sufficient 
conditions are available in the calculus of variations, but since these are quite 
complicated, we will not consider them here. In actual practice, the geometry 
or physics of the problem under discussion often makes it possible to deter¬ 
mine whether a particular stationary function maximizes or minimizes the 
integral (or neither). The reader who is interested in sufficient conditions and 
other theoretical problems will find adequate discussions in the books men¬ 
tioned in Section 66. 

As it stands, Euler's equation (9) is not very illuminating. In order to inter¬ 
pret it and convert it into a useful tool, we begin by emphasizing that the 
partial derivatives df/dy and df/dy' are computed by treating x, y, and y’ as 
independent variables. In general, however, df/dy’ is a function of x explicitly. 


3 In more detail, the indirect argument leading to (9) is as follows. Assume that the bracketed 
function in (8) is not zero (say, positive) at some point x = a in the interval. Since this function 
is continuous, it will be positive throughout some subinterval about x = a. Choose an r|(x) 
that is positive inside the subinterval and zero outside. For this r|(x), the integral in (8) will 
be positive—which is a contradiction. When this argument is formalized, the resulting state¬ 
ment is known as the fundamental lemma of the calculus of variations. 
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and also implicitly through y and y', so the first term in (9) can be written in 
the expanded form 


d 

f 

d 

-f- 

f df y 

dy i d 


dx 

W) 


Wj 

dx dy' 

w) 


Accordingly, Euler's equation is 

/yV 0 +/yV £ +(/ **- f y) = Q - (10) 

This equation is of the second order unless / , , = 0, so in general the 
extremals—its solutions—constitute a two-parameter family of curves; and 
among these, the stationary functions are those in which the two parameters 
are chosen to fit the given boundary conditions. A second order nonlinear 
equation like (10) is usually impossible to solve, but fortunately many appli¬ 
cations lead to special cases that can be solved. 

CASE A. If x and y are missing from the function/ then Euler's equation 
reduces to 


dhy 

hV dx 2 


= 0 ; 


and if /* 0, we have d 2 y/dx 2 = 0 and y = c,x + c 2 , so the extremals are all 
straight lines. 

CASE B. If y is missing from the function / then Euler's equation becomes 


dJdf^ 

dx{dy' y 


= 0, 


and this can be integrated at once to yield the first order equation 


dL 

dy' 


= Ci 


for the extremals. 

CASE C. If x is missing from the function/ then Euler's equation can be 
integrated to 
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df ’ ( 

1/ - f = Ci. 


This follows from the identity 


d_ 

dx 


dij 


iV-f 


= y 


!_ 

dx 


f df' 
kWj 


dy 


8f_ 

dx' 


since df/dx = 0 and the expression in brackets on the right is zero by Euler's 
equation. 

We now apply this machinery to the three problems formulated in 
Section 66. 


Example 1. To find the shortest curve joining two points (x u yd and 
[x 2 ,yd —which we know intuitively to be a straight line—we must mini¬ 
mize the arc length integral 


1 = 




dx- 


The variables x and y are missing from f(y') = Jl + (y') 2 , so this problem 
falls under Case A. Since 


fy'V 


_ 8 2 f 


dy' 2 [i + (y') 2 ] 3/2 


* 0 , 


Case A tells us that the extremals are the two-parameter family of 
straight lines y - c-yX + c 2 . The boundary conditions yield 


y-y 1 = M!(*-*) 

X2-X1 


( 11 ) 


as the stationary curve, and this is of course the straight line joining 
the two points. It should be noted that this analysis shows only that if 
I has a stationary value, then the corresponding stationary curve must 
be the straight line (11). However, it is clear from the geometry that I 
has no maximizing curve but does have a minimizing curve, so we 
conclude in this way that (11) actually is the shortest curve joining our 
two points. 

In this example we arrived at an obvious conclusion by analytical means. 
A much more difficult and interesting problem is that of finding the shortest 
curve joining two fixed points on a given surface and lying entirely on that 
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surface. These curves are called geodesics, and the study of their properties 
is one of the focal points of the branch of mathematics known as differential 
geometry. 


Example 2. To find the curve joining the points (x y yd and ( x 2 , yd that 
yields a surface of revolution of minimum area when revolved about the 
x-axis, we must minimize 


I = 



( 12 ) 


The variable x is missing from f{y,y') = 2ra/^l + (y') 2 , so Case C tells us 
that Euler's equation becomes 


yiy'f 

V 1 + (y ') 2 


y^+Wf = cv 


which simplifies to 


qy 


'=Vy 5 


-Cl . 


On separating variables and integrating, we get 


x 



Cl log 


y + 


l 


-Cl 


+ C2, 


and solving for y gives 


y = Ci cosh 


x-c 2 


(13) 


The extremals are therefore catenaries, and the required minimal sur¬ 
face—if it exists—must be obtained by revolving a catenary. The next 
problem is that of seeing whether the parameters c t and c 2 can indeed be 
chosen so that the curve (13) joins the points (x lf yd and (x 2/ yd- 
The choosing of these parameters turns out to be curiously compli¬ 
cated. If the curve (13) is made to pass through the first point (x v yd, then 
one parameter is left free. Two members of this one-parameter family 
are shown in Figure 101. It can be proved that all such curves are tan¬ 
gent to the dashed curve C, so no curve in the family crosses C. Thus, 
when the second point (x 2 , yd is below C, as in Figure 101, there is no 
catenary through both points and no stationary function exists. In this 
case it is found that smaller and smaller surfaces are generated by curves 
that approach the dashed line from (x lr yd to (x 2 , 0) to (x 2 , 0) to (x 2 , yd 
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FIGURE 101 


so no admissible curve can generate a minimal surface. When the sec¬ 
ond point lies above C, there are two catenaries through the points, and 
hence two stationary functions, but only the upper catenary generates a 
minimal surface. Finally, when the second point is on C, there is only one 
stationary function but the surface it generates is not minimal. 4 


Example 3. To find the curve of quickest descent in Figure 99, we must 
minimize 


1 = 


I 


V*+(y ') 2 


dx. 


Again the variable x is missing from the function f(y, y') = J 1 + (y') 2 / f 2gy, 
so by Case C, Euler's equation becomes 

(y'f V 1 +(y') 2 _ Ci 
x/y 


4 A full discussion of these statements, with proofs, can be found in Chapter IV of G. A. Bliss's 
book Calculus of Variations, Carus Monograph no. 1, Mathematical Association of America, 
1925. 
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This reduces to 


y[ 1 + (y') 2 l = c. 


which is precisely the differential equation 6-(4) arrived at in our earlier 
discussion of this famous problem. Its solution is given in Section 6. The 
resulting stationary curve is the cycloid 


x = a(Q - sin 0) and y = a(l - cos 0) 


(14) 


generated by a circle of radius a rolling under the x axis, where a is 
chosen so that the first inverted arch passes through the point (x 2 , y 2 ) 
in Figure 99. As before, this argument shows only that if I has a mini¬ 
mum, then the corresponding stationary curve must be the cycloid (14). 
However, it is reasonably clear from physical considerations that I has 
no maximizing curve but does have a minimizing curve, so this cycloid 
actually minimizes the time of descent. 

We conclude this section with an easy but important extension of our treat¬ 
ment of the integral (1). This integral represents variational problems of the 
simplest type because it involves only one unknown function. However, 
some of the situations we will encounter below are not quite so simple, for 
they lead to integrals depending on two or more unknown functions. 

For example, suppose we want to find conditions necessarily satisfied by 
two functions y(x) and z(x) that give a stationary value to the integral 



(15) 


where the boundary values y(x,), z(x 1 ) and y(x 2 ), z(x 2 ) are specified in advance. 
Just as before, we introduce functions q^x) and q 2 (x) that have continuous 
second derivatives and vanish at the endpoints. From these we form the 
neighboring functions y(x) = y(x) + aqi(x) and z(x) = z(x) + aq 2 (x), and then 
consider the function of a defined by 


1(a) = f(x, y + ar|i z + aq 2 ,y' + aqi, z' + aq' 2 )dx- 


(16) 


Again, if y(x) and z(x) are stationary functions we must have f'(0) = 0, so by 
computing the derivative of (16) and putting a = 0 we get 
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or, if the terms involving r|i and v[ 2 are integrated by parts. 


JU> 


df__d_(cf^ 
dy dx{dy ', 


+Mx) 


df d ( df 
dz dx { dz' 


> dx = 0. 


(17) 


Finally, since (17) must hold for all choices of the functions Tp(x) and r| 2 (x), we 
are led at once to Euler's equations 


d_ 

dx 


dy’ 


- — = 0 and 

dy 




dx I dz’ J dz 


= 0 . 


(18) 


Thus, to find the extremals of our problem, we must solve the system (18). 
Needless to say, a system of intractable equations is harder to solve than only 
one; but if (18) can be solved, then the stationary functions are determined by 
fitting the resulting solutions to the given boundary conditions. Similar con¬ 
siderations apply without any essential change to integrals like (15) which 
involve more than two unknown functions. 


Problems 

1. Find the extremals for the integral (1) if the integrand is 

(a) yji + (y') 2 ; 

y 

(b) y 2 - (y') 2 . 

2. Find the stationary function of 

4 

j[xy'-(yf]dx 

0 

which is determined by the boundary conditions t/(0) = 0 and y(4) = 3. 

3. When the integrand in (1) is of the form 

a(x)(y') 2 + 2 h(x)yy' + c(x)y 2 , 

show that Euler's equation is a second order linear differential equation. 

4. If P and Q are two points in a plane, then in terms of polar coordinates, 
the length of a curve from P to Q is 
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Find the polar equation of a straight line by minimizing this integral 

(a) with 0 as the independent variable; 

(b) with r as the independent variable. 

5. Consider two points P and Q on the surface of the sphere x 2 + y 2 + 
z 2 = a 2 , and coordinatize this surface by means of the spherical coordi¬ 
nates 0 and cf>, where x = a sin<|> cos 0 ,y = a sin <|> sin 0, and z-a cos <|). 
Let 0 = P(t)>) be a curve lying on the surface and joining P and Q. Show 
that the shortest such curve (a geodesic) is an arc of a great circle, that 
is, that it lies on a plane through the center. Hint: Express the length of 
the curve in the form 


Q Q 



P P 



solve the corresponding Euler equation for 0, and convert the result 
back into rectangular coordinates. 

6. Prove that any geodesic on the right circular cone z 2 = a 2 (x 2 + if), z > 0, 
has the following property: If the cone is cut along a generator and 
flattened into a plane, then the geodesic becomes a straight line. Hint: 
Represent the cone parametrically by means of the equations 



show that the parameters r and 0 represent ordinary polar coordinates 
on the flattened cone; and show that a geodesic r = r(0) is a straight line 
in these polar coordinates. 

7. If the curve y = g(z) is revolved about the z-axis, then the resulting sur¬ 
face of revolution has x 2 + y 2 - g(z) 2 as its equation. A convenient para¬ 
metric representation of this surface is given by 


x = g(z) cos 0, y = g(z) sin 0, z = z. 


where 0 is the polar angle in the xy-plane. Show that a geodesic 0 = 0(z) 
on this surface has 



as its equation. 

8. If the surface of revolution in Problem 7 is a right circular cylinder, 
show that every geodesic of the form 0 = 0(z) is a helix or a generator. 
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68 Isoperimetric Problems 

The ancient Greeks proposed the problem of finding the closed plane curve 
of given length that encloses the largest area. They called this the isoperimet¬ 
ric problem, and were able to show in a more or less rigorous manner that 
the obvious answer—a circle—is correct. 5 If the curve is expressed para¬ 
metrically by x = x(f) and y = i/(f), and is traversed once counterclockwise as t 
increases from f to t 2 , then the enclosed area is known to be 



which is an integral depending on two unknown functions. 6 Since the length 
of the curve is 



the problem is to maximize (1) subject to the side condition that (2) must 
have a constant value. The term isoperimetric problem is usually extended to 
include the general case of finding extremals for one integral subject to any 
constraint requiring a second integral to take on a prescribed value. 

We will also consider finite side conditions, which do not involve integrals 
or derivatives. For example, if 


G(x, y, z) = 0 (3) 

is a given surface, then a curve on this surface is determined parametrically 
by three functions x = x(f), y = y(t), and z = z(f) that satisfy equation (3), and 
the problem of finding geodesics amounts to the problem of minimizing the 
arc length integral 



subject to the side condition (3). 


5 See B. L. van der Waerden, Science Awakening, pp. 268-269, Oxford University Press, London, 
1961; also, G. Polya, Induction and Analogy in Mathematics, Chapter 10, Princeton University 
Press, Princeton, N.J., 1954. 

6 Formula (1) is a special case of Green's theorem. Also, see Problem 1. 
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Lagrange multipliers. It is necessary to begin by considering some problems 
in elementary calculus that are quite similar to isoperimetric problems. For 
example, suppose we want to find the points (x, y) that yield stationary val¬ 
ues for a function z = / (x, y), where, however, the variables x and y are not 
independent but are constrained by a side condition 


g(x, y) = 0. (5) 

The usual procedure is to arbitrarily designate one of the variables x and y in 
(5) as independent, say x, and the other as dependent on it, so that dy/dx can 
be computed from 


dg + dgdi = 0 

dx dy dx 


We next use the fact that since z is now a function of x alone, dz/dx = 0 is a 
necessary condition for z to have a stationary value, so 


dx dx dy dx 


or 


df df dg/dx _ 
dx dy dg/dy 


On solving (5) and (6) simultaneously, we obtain the required points (x, y)7 
One drawback to this approach is that the variables x and y occur sym¬ 
metrically but are treated unsymmetrically. It is possible to solve the same 
problem by a different and more elegant method that also has many practical 
advantages. We form the function 

F(x, y, X) =/(x, y) + Xg(x, y) 

and investigate its unconstrained stationary values by means of the necessary 
conditions 


7 In very simple cases, of course, we can solve (5) for y as a function of x and insert this in 2 = 
f(x,y), which gives z as an explicit function of x; and all that remains is to compute dz/dx, solve 
the equation dz/dx = 0, and find the corresponding i/'s. 
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aF = £ +Jt & = 0/ 

dx dx dx 

^ = o, P) 

dy dy dy 

dF 

a = gfe!,) ' 0 - 

If X is eliminated from the first two of these equations, then the system clearly 
reduces to 


d f dg/dx Q 
dx dy dg/dy 


and g{x r y) = 0. 


and this is the system obtained in the above paragraph. It should be 
observed that this technique (solving the system (7) for x and y) solves the 
given problem in a way that has two major features important for theoreti¬ 
cal work: it does not disturb the symmetry of the problem by making an 
arbitrary choice of the independent variable; and it removes the side condi¬ 
tion at the small expense of introducing X as another variable. The param¬ 
eter X is called a Lagrange multiplier, and this method is known as the method 
of Lagrange multipliers. 8 This discussion extends in an obvious manner to 
problems involving functions of more than two variables with several side 
conditions. 

Integral side conditions. Here we want to find the differential equation 
that must be satisfied by a function y(x) that gives a stationary value to the 
integral 


x 2 

I = \f(x,y,y')dx, 

xi 


where y is subject to the side condition 


( 8 ) 


X2 

J = jg(x,y,y')dx,=c 

XI 


( 9 ) 


and assumes prescribed values t/(x,) = y 1 and y(x 2 ) = y 2 at the end-points. As 
before, we assume that y(x) is the actual stationary function and disturb it 


A brief account of Lagrange is given in Appendix A. 
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slightly to find the desired analytic condition. However, this problem cannot 
be attacked by our earlier method of considering neighboring functions of 
the form y(x) = y(x) + ar|(x), for in general these will not maintain the second 
integral / at the constant value c Instead, we consider a two-parameter family 
of neighboring functions, 

y(x) = y(x) + airii(x) + a 2 r| 2 (x), (10) 

where rp(x) and r| 2 (x) have continuous second derivatives and vanish at the 
endpoints. The parameters ex, and a 2 are not independent, but are related by 
the condition that 


*2 

/(ai,a 2 ) = | g(x,y,y')dx = c. 


( 11 ) 


Our problem is then reduced to that of finding necessary conditions for the 
function 


X2 

I(ai,a 2 ) = J* f(x,y,y')dx 


( 12 ) 


to have a stationary value at ocj = a 2 = 0, where a, and a 2 satisfy (11) This situ¬ 
ation is made to order for the method of Lagrange multipliers We therefore 
introduce the function 


K(a 1/ a 2 ,k) = I(ai,a 2 ) + k/(ai,a 2 ) 

*2 

= j* F{x,y ,y')dx, (13) 

*1 


where 


F=f+^g’ 


and investigate its unconstrained stationary value at oij = a 2 = 0 by means of 
the necessary conditions 

dK dK o u n 

-=-= 0 when ai = a 2 = 0. 

5ai 5a 2 


( 14 ) 
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If we differentiate (13) 

dK 
da i 

and setting otj = a 2 = 0 


under the integral sign and use (10), we get 


x 2 

1 


dF dF 

— T]i(x) + —T\i(x) 

dy dy 


dx for i = 1,2; 


yields 



xi 


h i(x) + 


dF 


dy' 


7 n'i(x) 


dx = 0 


by virtue of (14). After the second term is integrated by parts, this becomes 



d_f dF^ 
dx{dyf 


dx = 0. 


(15) 


Since p,(x) and r\ z (x) are both arbitrary, the two conditions embodied in (15) 
amount to only one condition, and as usual we conclude that the stationary 
function y(x) must satisfy Euler's equation 


d_(dF ^ 
dx[dyf 



(16) 


The solutions of this equation (the extremals of our problem) involve three 
undetermined parameters: two constants of integration, and the Lagrange 
multiplier X. The stationary function is then selected from these extremals 
by imposing the two boundary conditions and giving the integral / its pre¬ 
scribed value c. 

In the case of integrals that depend on two or more functions, this result 
can be extended in the same way as in the previous section. For example, if 


1 = 


*2 

| f(x,y,z,y',z')dx 


XI 


has a stationary value subject to the side condition 
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then the stationary functions y(x) and z(x) must satisfy the system of equations 

= 0, (17) 


d_ 

dx 



-- F =0 and 


' dF 

1%'J 

dy 

dx 

,&'J 


dz 


where F =f+Xg. The reasoning is similar to that already given, and we omit 
the details. 


Example 1. We shall find the curve of fixed length L that joins the points 
(0,0) and (1,0), lies above the x-axis, and encloses the maximum area 
between itself and the x-axis. This is a restricted version of the original 
isoperimetric problem in which part of the curve surrounding the area 
to be maximized is required to be a line segment of length 1. Our prob¬ 
lem is to maximize y dx subject to the side condition 
Jo 

i 

J sfi~+Wfdx = L 

0 


and the boundary conditions y(0) = 0 and y(l)=0. Here we have 
F = y + X^l + (y') 2 , so Euler's equation is 


dx 


Xy' 


V 1 + (y ') 2 


-1 = 0 , 


(18) 


or, after carrying out the differentiation. 


y" _ 1 
U + (J t'ff 2 a.' 


(19) 


In this case no integration is necessary, since (19) tells us at once that the cur¬ 
vature is constant and equals 1/X. It follows that the required maximizing 
curve is an arc of a circle (as might have been expected) with radius X. As an 
alternate procedure, we can integrate (18) to get 

V’ _ x —C\ 

Vi+(yf * 

On solving this for y' and integrating again, we obtain 

(x - cf 2 + (y - c 2 ) 2 = X 2 , (20) 


which of course is the equation of a circle with radius X. 
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Example 2. In Example 1 it is clearly necessary to have L > 1. Also, if 
L> jt/2 the circular arc determined by (20) will not define y > 0 as a sin¬ 
gle-valued function of x. We can avoid these artificial issues by consid¬ 
ering curves in parametric form x = x(t) and y = y(f) and by turning our 
attention to the original isoperimetric problem of maximizing 

1 ^ 

-j(xy-iyi)dt 

h 


(where x = dx/dt and y = dy/dt) with the side condition 

t2 

J, Jx 2 + y 2 dt - 


- L . 


Here we have 


F = ^( x il + yx) + xjx 2 +y 2 , 


so the Euler equations (17) are 


1 Xx 

—V —< = 

2 


1 . 

—y = 0 

2 J 


and 


1 Xy 

—x + —j=^= 

2 ^ 7,2 


x + y 


+ —i = 0. 
2 


These equations can be integrated directly, which yields 

Xx , Xy 

-y + —= -Ci and x + ,—' -= c 2 . 

\ x '-y : 


If we solve for x - c 2 and y - c v square, and add, then the result is 
{x - cff + (y - cf 2 = X 2 , 

so the maximizing curve is a circle. This result can be expressed in the 
following way: if L is the length of a closed plane curve that encloses an 
area A, then A < L 2 / 4k, with equality if and only if the curve is a circle, A 
relation of this kind is called an isoperimetric inequality. 9 


9 Students of physics may be interested in the ideas discussed in G. Polya and G. Szego, 
Isoperimetric Inequalities in Mathematical Physics, Princeton University Press Princeton N.J., 
1951. 
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Finite side conditions. At the beginning of this section we formulated the 
problem of finding geodesics on a given surface 

G(x, y, z) = 0. (21) 

We now consider the slightly more general problem of finding a space curve 
x = x(t), y = y(t), z = z(t) that gives a stationary value to an integral of the form 

h 

jf(x,y,z)dt, (22) 

h 


where the curve is required to lie on the surface (21). 

Our strategy is to eliminate the side condition (21), and to do this we pro¬ 
ceed as follows. There is no loss of generality in assuming that the curve lies 
on a part of the surface where G 2 * 0. On this part of the surface we can solve 
(21) for z, which gives z = g(x, y) and 


. dg . dg . 
z = — x + — y. 
dx dy 


(23) 


When (23) is inserted in (22), our problem is reduced to that of finding uncon¬ 
strained stationary functions for the integral 


1 / 


. . dg . dg . 
x,xi,—x + — y 
J dx dy J 


dt. 


We know from the previous section that the Euler equations 67-(18) for this 
problem are 


d f df + df dgdf dz 
dt y dx dz dx ) dz dx 


and 


d_(% + dfdg" 
dtydy dz dy J 


df dz _ 
dz dy 


It follows from (23) that 


dz 

4 

'g\ 

and i 

_ d 

( a y 

dx 

dt 

K dx) 

% 

dt 

{ fy) 
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so the Euler equations can be written in the form 


df v fir ) dx dt\dz) 


and 


d_ 

dt 




tlfil 

dy dt v fiz ) 


= 0 . 


If we now define a function /.(f) by 


d_ 

dt 



= W)G Z , 


(24) 


and use the relations dg/dx = -G x /G z and dg/dy = -G XJ /G Z , then Euler's equa¬ 
tions become 


and 


d_( df_ 
dt\dx 


X(t)G x , 


(25) 


d_(djg 

dt{dy y 


X(t)G y . 


(26) 


Thus a necessary condition for a stationary value is the existence of a func¬ 
tion X(t) satisfying equations (24), (25), and (26). On eliminating /.(f), we obtain 
the symmetric equations 

(d/dt)(df/dx) _ ( d/dt)(df/8y) _ (d/dt)(df/dz) 

G x Gy G z y 

which together with (21) determine the extremals of the problem. It is worth 
remarking that equations (24), (25), and (26) can be regarded as the Euler 
equations for the problem of finding unconstrained stationary functions for 
the integral 


h 

| [f(x, ij, z) + X(t )G(x, y, z)]dt. 

h 


This is very similar to our conclusion for integral side conditions, except that 
here the multiplier is an undetermined function of f instead of an undeter¬ 
mined constant. 
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When we specialize this result to the problem of finding geodesics on the 
surface (21), we have 


f = yjx 2 + y 2 + z 2 . 


The equations (27) become 

(d/dt)(x/f) _ (d/dt)(y/f) _ (d/dt)(z/f) 

G x G y G 3 

and the problem is to extract information from this system. 


(28) 


Example 3. If we choose the surface (21) to be the sphere x 2 + y 2 + z 2 = a 2 
then G(x,y,z) - x z + y 2 + z 2 - a 2 and (28) is 

fi-xf = fij-yf = fi-zf 
2xf- 2 yf- 2 zf- ' 

which can be rewritten in the form 

xy-yx = f_ = yz-zy 
xy-yx f yi-zy 

If we ignore the middle term, this is 

(d/dt)(xy - yx) (d/dt)(yz-zy) 
xij -yx yz- zy 


One integration gives xy -yx = cfyz - zy) or 

x + Ciz _ y 
x + ccz y 

and a second yields x + c 2 z = c 2 y. This is the equation of a plane through 
the origin, so the geodesics on a sphere are arcs of great circles. A differ¬ 
ent method of arriving at this conclusion is given in Problem 67-5. 

In this example we were able to solve equations (28) quite easily, but in gen¬ 
eral this task is extremely difficult. The main significance of these equations 
lies in their connection with the following very important result in math¬ 
ematical physics: if a particle glides along a surface, free from the action of 
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any external force, then its path is a geodesic. We shall prove this dynamical 
theorem in Appendix B. For the purpose of this argument it will be conve¬ 
nient to assume that the parameter t is the arc length s measured along the 
curve, so that f- 1 and equations (28) become 


d 2 x/ds 2 d 2 y/ds 2 
G.V Gy 



(29) 


Problems 


1. Convince yourself of the validity of formula (1) for a closed convex curve 
like that shown in Figure 102. Hint: What is the geometric meaning of 


Q P 

j* y dx + j* y dx, 

p Q 


where the first integral is taken from right to left along the upper part 
of the curve and the second from left to right along the lower part? 

2. Verify formula (1) for the circle whose parametric equations are 
x = a cos f and y - a sin f, 0 < f < 2 ti. 

3. Solve the following problems by the method of Lagrange multipliers, 
(a) Find the point on the plane ax + by + cz = d that is nearest the origin. 

Hint: Minimize w = x * 1 2 3 + y 2 + z 2 with the side condition ax + by + 
cz - d = 0. 


y 


Q 



p 


X 


FIGURE 102 
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(b) Show that the triangle with greatest area A for a given 
perimeter is equilateral. Hint: If x, y, and z are the sides, then 

A = ^s(s-x)(s-y)(s-z) where s = (x + y + z)/2. 

(c) If the sum of n positive numbers x y x 2 ,-, x n has a fixed value s, prove 
that their product XjX 2 — x n has s"/n" as its maximum value, and con¬ 
clude from this that the geometric mean of n positive numbers can 
never exceed their arithmetic mean: 


%JxiX 2 ---X n < 


X\ + X2 + • • * + X n 

n 


4. A curve in the first quadrant joins (0,0) and (1,0) and has a given area 
beneath it. Show that the shortest such curve is an arc of a circle. 

5. A uniform flexible chain of given length hangs between two points. Find 
its shape if it hangs in such a way as to minimize its potential energy. 

6. Solve the original isoperimetric problem (Example 2) by using polar 
coordinates. Hint: Choose the origin to be any point on the curve and 
the polar axis to be the tangent line at that point; then maximize 

||r 2 d 0 

0 


with the side condition that 



must be constant. 

7. Show that the geodesics on any cylinder of the form g(x,z) = 0 make a 
constant angle with the y-axis. 


Appendix A. Lagrange 

Joseph Louis Lagrange (1736-1813) detested geometry but made outstanding 
discoveries in the calculus of variations and analytical mechanics. He also 
contributed to number theory and algebra, and fed the stream of thought 
that later nourished Gauss and Abel. His mathematical career can be viewed 
as a natural extension of the work of his older and greater contemporary, 
Euler, which in many respects he carried forward and refined. 

Lagrange was born in Turin of mixed French-Italian ancestry. As a boy, 
his tastes were more classical than scientific; but his interest in mathematics 
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was kindled while he was still in school by reading a paper by Edmund 
Halley on the uses of algebra in optics. He then began a course of indepen¬ 
dent study, and progressed so rapidly that at the age of nineteen he was 
appointed professor of mathematics at the Royal Artillery School in Turin. 10 

Lagrange's contributions to the calculus of variations were among his 
earliest and most important works. In 1755 he communicated to Euler his 
method of multipliers for solving isoperimetric problems. These problems 
had baffled Euler for years, since they lay beyond the reach of his own semi- 
geometrical techniques. Euler was immediately able to answer many ques¬ 
tions he had long contemplated; but he replied to Lagrange with admirable 
kindness and generosity, and withheld his own work from publication "so 
as not to deprive you of any part of the glory which is your due." Lagrange 
continued working for a number of years on his analytic version of the cal¬ 
culus of variations, and both he and Euler applied it to many new types of 
problems, especially in mechanics. 

In 1766, when Euler left Berlin for St. Petersburg, he suggested to Frederick 
the Great that Lagrange be invited to take his place. Lagrange accepted and 
lived in Berlin for 20 years until Frederick's death in 1786. During this period 
he worked extensively in algebra and number theory and wrote his master¬ 
piece, the treatise Mecanique Analytique (1788), in which he unified general 
mechanics and made of it, as Hamilton later said, "a kind of scientific poem." 
Among the enduring legacies of this work are Lagrange's equations of 
motion, generalized coordinates, and the concept of potential energy (which 
are all discussed in Appendix B). * 11 

Men of science found the atmosphere of the Prussian court rather uncon¬ 
genial after the death of Frederick, so Lagrange accepted an invitation from 
Louis XVI to move to Paris, where he was given apartments in the Louvre. 
Lagrange was extremely modest and undogmatic for a man of his great 
gifts; and though he was a friend of aristocrats—and indeed an aristocrat 
himself—he was respected and held in affection by all parties throughout 
the turmoil of the French Revolution. His most important work during these 
years was his leading part in establishing the metric system of weights and 
measures. In mathematics, he tried to provide a satisfactory foundation for 
the basic processes of analysis, but these efforts were largely abortive. Toward 
the end of his life, Lagrange felt that mathematics had reached a dead end, 
and that chemistry, physics, biology, and other sciences would attract the 
ablest minds of the future. His pessimism might have been relieved if he had 
been able to forsee the coming of Gauss and his successors, who made the 
nineteenth century the richest in the long history of mathematics. 


10 See George Sarton's valuable essay, "Lagrange's Personality," Proc. Am. Phil. Soc., vol, 88, 
pp. 457-496 (1944). 

11 For some interesting views on Lagrangian mechanics (and many other subjects), see 
S. Bochner, The Role of Mathematics in the Rise of Science, pp. 199-207, Princeton University 
Press, Princeton, N.J., 1966. 
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Appendix B. Hamilton's Principle and Its Implications 

One purpose of the mathematicians of the eighteenth century was to discover 
a general principle from which Newtonian mechanics could be deduced. 
In searching for clues, they noted a number of curious facts in elementary 
physics: for example, that a ray of light follows the quickest path through an 
optical medium; that the equilibrium shape of a hanging chain minimizes 
its potential energy; and that soap bubbles assume a shape having the least 
surface area for a given volume. These facts and others suggested to Euler 
that nature pursues its diverse ends by the most efficient and economical 
means, and that hidden simplicities underlie the apparent chaos of phenom¬ 
ena. It was this metaphysical idea that led him to create the calculus of varia¬ 
tions as a tool for investigating such questions. Euler's dream was realized 
almost a century later by Hamilton. 


Hamilton's principle. Consider a particle of mass m moving through space 
under the influence of a force 


F = Fji + F 2 j + F 3 k, 


and assume that this force is conservative in the sense that the work it does in 
moving the particle from one point to another is independent of the path. It is 
easy to show that there exists a scalar function U(x, y, z) such that dU/dx = F v 
dU/dy = F 2 , and dU/dz = F 3 . 12 The function V = - U is called the potential 
energy of the particle, since the change in its value from one point to another 
is the work done against F in moving the particle from the first point to the 
second. Furthermore, if r(t) = x(t)i + y(t) j + z(f)k is the position vector of the 
particle, so that 


dx . dy . dz , , 

v = —1 + —1 + — k and 

dt dt dt 



are its velocity and speed, respectively, then T = mv 2 / 2 is its kinetic energy. 

If the particle is at points P 1 and P 2 at times t 1 and f 2 , then we are interested 
in the path it traverses in moving from P, to P 2 . The action (or Hamilton's inte¬ 
gral) is defined as 

*2 

A = j(T-V)dt, 

ti 


and in general its value depends on the path along which the particle moves 
in passing from P 1 to P 2 . We will show that the actual path of the particle is 
one that yields a stationary value for the action A. 


12 In the language of vector analysis, F is the gradient of U. 
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The function L = T - V is called the Lagrangian, and in the case under con¬ 
sideration it is given by 


L = 



-V(x,y,z). 


The integrand of the action is therefore a function of the to r m / (x, y, z, dx/dt, 
dy/dt, dz/dt), and if the action has a stationary value, then Euler's equations 
must be satisfied. These equations are 


d 2 x dV 
m— Y +— = 0, 
dt 2 dx 


m 


d 2 y dV n d 2 z dV n 
—, + — = 0, m —, + — = 0, 


dt~ dy 


dt dz 


and can be written in the form 


m 


dt 2 


dV . 

— i 

dx 


5V ,_0V 
dy dz 


k = F. 


This is precisely Newton's second law of motion. Thus Newton's law is a 
necessary condition for the action of the particle to have a stationary value. 
Since Newton's law governs the motion of the particle, we have the following 
conclusion. 

Hamilton's principle. If a particle moves from a point P l to a point P 2 in a 
time interval t 2 < t < t 2 , then the actual path it follows is one for which the action 
assumes a stationary value. 

It is quite easy to give simple examples in which the actual path of a par¬ 
ticle maximizes the action. However, if the time interval is sufficiently short, 
then it can be shown that the action is necessarily a minimum. In this form, 
Hamilton's principle is sometimes called the principle of least action, and can 
be loosely interpreted as saying that nature tends to equalize the kinetic and 
potential energies throughout the motion. 

In the above discussion we assumed Newton's law and deduced Hamilton's 
principle as a consequence. The same argument shows that Newton's law fol¬ 
lows from Hamilton's principle, so these two approaches to the dynamics of 
a particle—the vectorial and the variational—are equivalent to one another. 
This result emphasizes the essential characteristic of variational principles 
in physics: they express the pertinent physical laws in terms of energy alone, 
without reference to any coordinate system. 

The argument we have given extends at once to a system of n particles of 
masses m, with position vectors r,(f) = x,(f)i + y ; (f)j + z,(f)k, which are moving 
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under the influence of conservative forces F, = F n i + F a j + F i3 k. Here the 
potential energy of the system is a function V(x v y v z v ..., x n , y n , z„) such that 


<9V 

dx, 


= ~F n , 


SV 

dy, 


= -F t2 , 


dV_ 

dzj 


= -f, 3 


the kinetic energy is 


T = 




+ 



and the action over a time interval t 1 <t <t 2 is 

t2 

A = j(T-V)dt. 

h 


In just the same way as above, we see that Newton's equations of motion for 
the system. 



are a necessary condition for the action to have a stationary value. Hamilton's 
principle therefore holds for any finite system of particles in which the forces 
are conservative. It applies equally well to more general dynamical systems 
involving constraints and rigid bodies, and also to continuous media. 

In addition, Hamilton's principle can be made to yield the basic laws of 
electricity and magnetism, quantum theory, and relativity. Its influence is so 
profound and far-reaching that many scientists regard it as the most power¬ 
ful single principle in mathematical physics and place it at the pinnacle of 
physical science. Max Planck, the founder of quantum theory, expressed this 
view as follows: "The highest and most coveted aim of physical science is to 
condense all natural phenomena which have been observed and are still to 
be observed into one simple principle.... Amid the more or less general laws 
which mark the achievements of physical science during the course of the 
last centuries, the principle of least action is perhaps that which, as regards 
form and content, may claim to come nearest to this ideal final aim of theo¬ 
retical research." 


Example 1. If a particle of mass m is constrained to move on a given sur¬ 
face G(x, y, z) = 0, and if no force acts on it, then it glides along a geodesic. 
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To establish this, we begin by observing that since no force is present we 
have V = 0, so the Lagrangian L = T - V reduces to T where 



We now apply Hamilton's principle, and require that the action 

tl £2 

J L At = J T dt 

h ti 


be stationary subject to the side condition G(x, y, z ) = 0. By Section 68, this 
is equivalent to requiring that the integral 

(i 

J" [T + X(t)G(x,y,z)]dt 

fi 

be stationary with no side condition, where X(t) is an undetermined 
function of t. Euler's equations for this unconstrained variational prob¬ 
lem are 


m^--XG x = 0, m —j- - XG y = 0, m^-XG-= 0. 

dt 2 dt 2 y dt 2 

When m and X are eliminated, these equations become 

d 2 x/dt 2 d 2 y/dt 2 d 2 z/dt 

G x Gy G z 

Now the total energy T + V = T of the particle is constant (we prove this 
below), so its speed is also constant, and therefore s = kt for some con¬ 
stant k if the arc length s is measured from a suitable point. This enables 
us to write our equations in the form 

d 2 x/ds 1 d 2 y/ds 2 d 2 z/ds 2 

G x Gy G z 

These are precisely equations 68-(29), so the path of the particle is a geo¬ 
desic on the surface, as stated. 

Lagrange's equations. In classical mechanics, Hamilton's principle can be 
viewed as the source of Lagrange's equations of motion, which occupy a 
dominant position in this subject. In order to trace the connection, we must 
first understand what is meant by degrees of freedom and generalized 
coordinates. 
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A single particle moving freely in three-dimensional space is said to have 
three degrees of freedom, since its position can be specified by three inde¬ 
pendent coordinates x, y, and z. By constraining it to move on a surface 
G(x,y,z)- 0, we reduce its degrees of freedom to two, since one of its coordi¬ 
nates can be expressed in terms of the other two. Similarly, an unconstrained 
system of n particles has 3 n degrees of freedom, and the effect of introducing 
constraints is to reduce the number of independent coordinates needed to 
describe the configurations of the system. If the rectangular coordinates of 
the particles are x„ y. and z, ( i - 1, 2,..., n), and if the constraints are described 
by k consistent and independent equations of the form 

G j (x 1/ y 1 ,z 1 ,...,x n ,y n ,z n ) = 0, j = l,2,...,k, 

then the number of degrees of freedom is m = 3 n - k. In principle these equa¬ 
tions canbeusedto reduce the number of coordinates from 3ntom by express¬ 
ing the 3 n numbers x ; , y, and z, (i = 1, 2 ,..., n) in terms of m of these numbers. 
It is more convenient, however, to introduce Lagrange's generalized coordinates 
q y q 2 ,..., q m , which are any m independent coordinates whatever whose values 
determine the configurations of the system. This allows us full freedom to 
choose any coordinate system adapted to the problem at hand—rectangular, 
cylindrical, spherical, or any other—and renders our analysis independent 
of any particular coordinate system. We now express the rectangular coordi¬ 
nates of the particles in terms of these generalized coordinates and note that 
the resulting formulas automatically include the constraints: x, = x, (q y ..., q m ), 
Vi = Vkdv- qj, and z { = zfq y ... qj, where i = 1, 2 ,..., n. 

If m, is the mass of the zth particle, then the kinetic energy of the system is 



and in terms of the generalized coordinates this can be written as 


T = 





( 1 ) 


where f = dqfdt. For later use, we point out that T is a homogeneous function 
of degree 2 in the cfo The potential energy V of the system is assumed to be a 
function of the q t alone, so the Lagrangian L = T - V is a function of the form 


k q2, ..., qm/ qi/ qz/ • • • / qm)‘ 
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Hamilton's principle tells us that the motion proceeds in such a way that the 

action L dt is stationary over any interval of time t 1 <t< t 2 , so Euler's equa- 
Jfl 

tions must be satisfied. In this case these are 


d_ 

dt 




df 





( 2 ) 


which are called Lagrange's equations. They constitute a system of m second 
order differential equations whose solution yields the q f as functions of t. 

We shall draw only one general deduction from Lagrange's equations, 
namely, the law of conservation of energy. 

The first step in the reasoning is to note the following identity, which holds 
for any function L of the variables t,q 1 ,q 2 ,...,q m ,q 1 ,(] 2 ,...,(]m'- 


d_ 

dt 


% 

i =i 


5L 

df 


-L 


Jj’ 


d 

i dL ) 

dt 



5L 

dqj 


dL 

dt 


(3) 


Since the Lagrangian L of our system satisfies equations (2) and does not 
explicitly depend on f, the right side of (3) vanishes and we have 





£ 


(4) 


for some constant £. We next observe that dV/df = 0, so dL/df = dT/df. As 
we have already remarked, formula (1) shows that T is a homogeneous func¬ 
tion of degree 2 in the f so 




8L 

df 


8T 




= 2 T 


by Euler's theorem on homogeneous functions. 13 With this result, equation 
(4) becomes 2T - L = £ or 2T - (T - V) = £, so 

T+V=E, 


13 Recall that a function f(x, y) is homogeneous of degree n in x and y if f(kx, ky) = k’f(x, y). 
If both sides of this are differentiated with respect to k and then k is set equal to 1, we obtain 

df , 

x -t + y-t =n f( x ’yf 

ox oy 

which is Euler's theorem for this function. The same result holds for a homogeneous func¬ 
tion of more than two variables. 
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which states that during the motion, the sum of the kinetic and potential 
energies is constant. 

In the following example we illustrate the way in which Lagrange's equa¬ 
tions can be used in specific dynamical problems. 


Example 2. If a particle of mass m moves in a plane under the influ¬ 
ence of a gravitational force of magnitude km/r 2 directed toward the 
origin, then it is natural to choose polar coordinates as the generalized 

coordinates: = r and q 2 = 0. It is easy to see that T = (m/2)(r 2 + r 2 0 2 ) and 
V = -km/r, so the Lagrangian is 


L = 


T-V = —(f 2 +r 2 0 2 ) + 
2 


km 

r 


and Lagrange's equations are 


d_fdL) 
dt i, dr j 


dL 

dr 


= 0 , 


d_fdL) 

dt UeJ 


SL 

50 


= 0 . 


(5) 

( 6 ) 


Since L does not depend explicitly on 0, equation (6) shows that 
dL /50 = mr 2 Q is constant, so 


2 dQ 
r — 
dt 


= h 


(7) 


for some constant h assumed to be positive. We next observe that (5) can 
easily be written in the form 


d 2 r L d0h 2 _ k 
dt 2 /dt J r 2 


This is precisely equation 21-(12), which we solved in Section 21 to obtain 
the conclusion that the path of the particle is a conic section. 

Variational problems for double integrals. Our general method of finding 
necessary conditions for an integral to be stationary can be applied equally 
well to multiple integrals. For example, consider a region R in the xy-plane 
bounded by a closed curve C (Figure 103). Let z = z(x, y) be a function that is 
defined in R and assumes prescribed boundary values on C, but is otherwise 
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FIGURE 103 


arbitrary (except for the usual differentiability conditions). This function can 
be thought of as defining a variable surface fixed along its boundary in space. 
An integral of the form 


f(z) = ||/(x,y , z,z I ,z y )dx dy (8) 

R 

will have values that depend on the choice of z, and we can pose the problem 
of finding a function z (a stationary function) that gives a stationary value to 
this integral. 

Our reasoning follows a familiar pattern. Assume that z(x, y) is the desired 
stationary function and form the varied function z(x,y) = z(x,y) + ar\(x,y), 
where r|(x, y) vanishes on C. When z is substituted into the integral (8), we 
obtain a function 1(a) of the parameter a, and just as before, the necessary 
condition J'(0) = 0 yields 






hy 


dx dy = 0. 


( 9 ) 


To simplify the task of eliminating rp and r| y/ we now assume that the curve 
C has the property that each line in the xty-plane parallel to an axis intersects 
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C in at most two points. 14 Then, regarding the double integral of the second 
term in parentheses in (9) as a repeated integral (see Figure 103), we get 


d *2(y) 


life T lx^dy = J J J^r \ x dxdy; 


c x\(y) 


and since 


J dz,- dZy „ J 1 0z, j 


x 2 

f 5 

f 9f ] 

J dx 

l dz x ) 


dx 


because q vanishes on C, it follows that 


ft d 

f ^ ] 

JJ 11 dx 

ydZ X ; 


dx dy. 


The term containing q can be transformed by a similar procedure, and (9) 
becomes 


ffn 

df d 

f df ^ 

d 

{ 9f 11 

JJ’ 1 

R 

dz dx 

ydZ Xy 

dy 



dx dy = 0. 


( 10 ) 


We now conclude from the arbitrary nature of q that the bracketed expres¬ 
sion in (10) must vanish, so 


d 

dx 


f 1 

d 

-\ - 

[ df) 

\dz x ) 

dy 

[ dz J 


-£ = 0 

dz 


( 11 ) 


is Euler's equation for an extremal in this case. As before, a stationary func¬ 
tion (if one exists) is an extremal that satisfies the given boundary conditions. 


Example 3. In its simplest form, the problem of minimal surfaces was 
first proposed by Euler as follows: to find the surface of smallest area 
bounded by a given closed curve in space. If we assume that this curve 


14 This restriction is unnecessary, and can be avoided it we are willing to use Green's theorem. 
























The Calculus of Variations 


617 


projects down to a closed curve C surrounding a region R in the xy- 
plane, and also that the surface is expressible in the form z = z{x, y), then 
the problem is to minimize the surface area integral 



R 


subject to the boundary condition that z(x, y) must assume prescribed 
values on C. Euler's equation (11) for this integral is 



( \ ( \ 


\ 


which can be written in the form 


Zxx (1 + Zy ) - 2z x z y z xy + z w (1 + zj) = 0 . 


( 12 ) 


This partial differential equation was discovered by Lagrange. Euler showed 
that every minimal surface not part of a plane must be saddle-shaped, and 
also that its mean curvature must be zero at every point. 15 The mathematical 
problem of proving that minimal surfaces exist, i.e., that (12) has a solution 
satisfying suitable boundary conditions, is extremely difficult. A complete 
solution was attained only in 1930 and 1931 by the independent work of 
T. Rado (Hungarian, 1895-1965) and J. Douglas (American, 1897-1965). An 
experimental method of finding minimal surfaces was devised by the blind 
Belgian physicist J. Plateau (1801-1883), who described it in his 1873 treatise 
on molecular forces in liquids. The essence of the matter is that if a piece 
of wire is bent into a closed curve and dipped in a soap solution, then the 
resulting soap film spanning the wire will assume the shape of a minimal 
surface in order to minimize the potential energy due to surface tension. 
Plateau performed many striking experiments of this kind, and since his 
time the problem of minimal surfaces has been known as Plateau's problem. 16 

Example 4. In Section 40 we obtained the one-dimensional wave equa¬ 
tion from Newton's second law of motion. In this example we deduce 
it from Hamilton's principle with the aid of equation (11). Assume the 
following: a string of constant linear mass density m is stretched with 
a tension T and fastened to the x-axis at the points x = 0 and x = n; it is 


15 The mean curvature of a surface at a point is defined as follows. Consider the normal line 
to the surface at the point, and a plane containing this normal line. As this plane rotates 
about the line, the curvature of the curve in which it intersects the surface varies, and the 
mean curvature is one-half the sum of its maximum and minimum values. 

16 The standard mathematical work on this subject is R. Courant, Dirichlet's Principle, Conformal 
Mapping, and Minimal Surfaces, Interscience-Wiley, New York, 1950. 
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plucked and allowed to vibrate in the xy-plane; and its displacements 
y(x,t ) are relatively small, so that the tension remains essentially con¬ 
stant and powers of the slope higher than the second can be neglected. 
When the string is displaced, an element of length dx is stretched to a 
length ds, where 


ds 


' \jl 1 /v dx 




This approximation results from expanding -yjl + y x =(l + iin the 
binomial series 1 + y\ jl + • • • and discarding all powers of y x higher than 


the second. The work done on the element is T(ds-dx) 
potential energy of the whole string is 


1 

2 


Tyldx, so the 


V = \ T \ V ' dx ' 

0 


The element has mass m dx and velocity y t , so its kinetic energy is — my]dx, 
and for the whole string we have 


n 

?7jJ" yfdx. 


The Lagrangian is therefore 


TC 

L = T - V = | J(mi / f 2 - Ty 2 x )dx, 

0 

and the action, which must be stationary by Hamilton's principle, is 

t2 n 

~ T yi) dx dt 

h o 


In this case equation (11) becomes 

T 

Px % - ytt, 
m 

which we recognize as the wave equation 40-(8). 

NOTE ON HAMILTON. The Irish mathematician and mathematical 
physicist William Rowan Hamilton (1805-1865) was a classic child prodigy. 
He was educated by an eccentric but learned clerical uncle. At the age of 
three he could read English; at four he began Greek, Latin, and Hebrew; at 
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eight he added Italian and French; at ten he learned Sanskrit and Arabic; 
and at thirteen he is said to have mastered one language for each year he 
had lived. This forced flowering of linguistic futility was broken off at the 
age of fourteen, when he turned to mathematics, astronomy, and optics. At 
eighteen he published a paper correcting a mistake in Laplace's Mecanique 
Celeste ; and while still an undergraduate at Trinity College in Dublin, he 
was appointed professor of astronomy at that institution and automatically 
became Astronomer Royal of Ireland. 

His first important work was in geometrical optics. He became famous at 
twenty-seven as a result of his mathematical prediction of conical refraction. 
Even more significant was his demonstration that all optical problems can be 
solved by a single method that includes Fermat's principle of least time as a 
special case. He then extended this method to problems in mechanics, and 
by the age of thirty had arrived at a single principle (now called Hamilton's 
principle) that exhibits optics and mechanics as merely two aspects of the 
calculus of variations. 

In 1835 he turned his attention to algebra, and constructed a rigorous 
theory of complex numbers based on the idea that a complex number is an 
ordered pair of real numbers. This work was done independently of Gauss, 
who had already published the same ideas in 1831, but with emphasis on the 
interpretation of complex numbers as points in the complex plane. Hamilton 
subsequently tried to extend the algebraic structure of the complex numbers, 
which can be thought of as vectors in a plane, to vectors in three-dimen¬ 
sional space. This project failed, but in 1843 his efforts led him to the dis¬ 
covery of quaternions. These are four-dimensional vectors that include the 
complex numbers as a subsystem; in modern terminology, they constitute 
the simplest noncommutative linear algebra in which division is possible. 17 
The remainder of Hamilton's life was devoted to the detailed elaboration of 
the theory and applications of quaternions, and to the production of mas¬ 
sive indigestible treatises on the subject. This work had little effect on phys¬ 
ics and geometry, and was supplanted by the more practical vector analysis 
of Willard Gibbs and the multilinear algebra of Grassmann and E. Cartan. 
The significant residue of Hamilton's labors on quaternions was the demon¬ 
strated existence of a consistent number system in which the commutative 
law of multiplication does not hold. This liberated algebra from some of the 
preconceptions that had paralyzed it, and encouraged other mathematicians 
of the late nineteenth and twentieth centuries to undertake broad investiga¬ 
tions of linear algebras of all types. 

Hamilton was also a bad poet and friend of Wordsworth and Coleridge, 
with whom he corresponded voluminously on science, literature, and 
philosophy. 


17 Fortunately Hamilton never learned that Gauss had discovered quaternions in 1819 but kept 
his ideas to himself. See Gauss, Werke, vol. VIII, pp. 357-362. 
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69 The Method of Successive Approximations 

One of the main recurring themes of this book has been the idea that only a 
few simple types of differential equations can be solved explicitly in terms 
of known elementary functions. Some of these types are described in the 
first three chapters, and Chapter 5 provides a detailed account of second 
order linear equations whose solutions are expressible in terms of power 
series. However, many differential equations fall outside these categories, 
and nothing we have done so far suggests a procedure that might work in 
such cases. 

We begin by examining the initial value problem described in Section 2: 

y'=f(x,y), y(x 0 )=y 0 , (1) 

where/(x, y) is an arbitrary function defined and continuous in some neigh¬ 
borhood of the point (x 0 , i/„). In geometric language, our purpose is to devise 
a method for constructing a function y=y(x) whose graph passes through 
the point (x 0 , y 0 ) and that satisfies the differential equation y' =/(x, y) in some 
neighborhood of x 0 (Figure 104). We are prepared for the idea that elemen¬ 
tary procedures will not work and that in general some type of infinite pro¬ 
cess will be required. 

The method we describe furnishes a line of attack for solving differential 
equations that is quite different from any the reader has encountered before. 
The key to this method lies in replacing the initial value problem (1) by the 
equivalent integral equation 


y(x) = y 0 + j f[t,y(t)]dt. (2) 

*0 
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FIGURE 104 


This is called an integral equation because the unknown function occurs 
under the integral sign. To see that (1) and (2) are indeed equivalent, suppose 
that y(x) is a solution of (1). Then y(x) is automatically continuous and the 
right side of 


y'(x)=f[x, y(x)\ 

is a continuous function of x; and when we integrate this from x 0 to x and 
use y(x 0 ) = y 0 , the result is (2). As usual, the dummy variable t is used in (2) 
to avoid confusion with the variable upper limit x on the integral. Thus any 
solution of (1) is a continuous solution of (2). Conversely, if y(x) is a continuous 
solution of (2), then y(x 0 )=y 0 because the integral vanishes when x = x 0 , and 
by differentiation of (2) we recover the differential equation y'(x)=/ [x, y(x)]. 
These simple arguments show that (1) and (2) are equivalent in the sense that 
the solutions of (1)—if any exist—are precisely the continuous solutions of 
(2). In particular, we automatically obtain a solution for (1) if we can construct 
a continuous solution for (2). 

We now turn our attention to the problem of solving (2) by a process of 
iteration. That is, we begin with a crude approximation to a solution and 
improve it step by step by applying a repeatable operation which we hope 
will bring us as close as we please to an exact solution. The primary advan¬ 
tage that (2) has over (1) is that the integral equation provides a convenient 
mechanism for carrying out this process, as we now see. 

A rough approximation to a solution is given by the constant function 
y 0 (x) =y 0 , which is simply a horizontal straight line through the point (x 0 , y 0 ). 





The Existence and Uniqueness of Solutions 


623 


We insert this approximation in the right side of equation (2) in order to 
obtain a new and perhaps better approximation yfx) as follows: 

ij 1 (x) = ij 0 + j f(t,ij 0 )dt. 

*0 


The next step is to use yfx) to generate another and perhaps even better 
approximation y 2 (x) in the same way: 


yi(x) = yo + J/[f,yi(f)]df. 
*0 


At the nth stage of the process we have 


y n (x) = y 0 + jf[t,y„-i(t)]dt. (3) 

*0 


This procedure is called Picard's method of successive approximations. 1 We show 
how it works by means of a few examples. 

The simple initial value problem 


y'=y, y(0) = i 

has the obvious solution y(x) = e x . The equivalent integral equation is 

X 

y(x) = l+j y(t)dt, 

o 


1 Emile Picard (1856-1941), one of the most eminent French mathematicians of the past century, 
made two outstanding contributions to analysis: his method of successive approximations, 
which enabled him to perfect the theory of differential equations that Cauchy had initiated in 
the 1820s; and his famous theorem (called Picard's Great Theorem) about the values assumed 
by a complex analytic function near an essential singularity, which has stimulated much 
important research down to the present day. Like a true Frenchman, he was a connoisseur of 
fine food and was particularly fond of bouillabiasse. 
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and (3) becomes 


X 

y„(x) = l +jy„_i (t)dt. 
0 


With y 0 (x) = 1, it is easy to see that 

X 

y 1 (x) = l + Jrff = l + x, 
o 

x 

y 2 (x) = l +j*(l + f) dt 

o 

x ( 

y 3 (*) = l + Jl + i + t 


= l + x + - 


2 > 

2 , 


X 2 r 3 

dt = \ + x + — + —— 
2 2-3' 


and in general 


/ \ - XX X 

l/„(x) = 1 + X + — + — + ••• + ■—. 

7 2! 3! n\ 

In this case it is very clear that the successive approximations do in fact con¬ 
verge to the exact solution, for these approximations are the partial sums of 
the power series expansion of e x . 

Let us now consider the problem 

y'=x+y, y(0) = l. (4) 

This is a first order linear equation, and the solution satisfying the given 
initial condition is easily found to be y{x) = 2e x -x-1. The equivalent integral 
equation is 


y(x) = 1 + J* [f + y{t)\ dt , 
0 


y n (x) = 1+J*[f + y„-i(t)]dt. 
0 


and (3) is 
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With y 0 (x) - 1, Picard's method yields 


y x (x) = 1 + J*(f + 1) dt = 1 

o 

X 

y 2 (x) = 1 + J 
0 
x 

y 3 (x) = 1 + J 


= t + x + - 


t ^ 

l + 2t + — 
21 


2 ! 


dt =1 + x + x 2 + —, 
3! 


4-3 \ 


1 + +1 + - 


3! 


T 2 X^ X 4 

= 1 + X + X H-+- 

3 4! 


dt 


X 

y 4 (x) = i+J 


t 3 t 4 ^ 

l + 2f+ t 2 + — + — 

3 4! 


2 x 3 x 4 x 5 


cif 


— 1 + X + X H-1-h 

3 3-4 5! 


and in general 


y„(x) = l + x + 2 


^ x 2 x 3 x" 

-1-1- • • • H- 

v 2! 3! n\ j 


^ x ,1+1 

(« + l)! 


This evidently converges to 

1 + x + 2(e x - x - 1) + 0 = 2e x - x - 1, 
so again we have the exact solution. 

In spite of these examples, the reader may not be entirely convinced of 
the practical value of Picard's method. What are we to do, for instance, if the 
successive integrations are very complicated, or not possible at all except in 
principle? This skepticism is justified, for the real power of Picard's method 
lies mainly in the theory of differential equations—not in actually find¬ 
ing solutions, but in proving under very general conditions that an initial 
value problem has a solution and that this solution is unique. Theorems 
that make precise assertions of this kind are called existence and uniqueness 
theorems. We shall state and prove several of these theorems in the next two 
sections. 
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Problems 

1. Find the exact solution of the initial value problem 



Starting with y 0 (x) = 1, apply Picard's method to calculate yfx), y 2 (x), 
i/ 3 (x), and compare these results with the exact solution. 

2. Find the exact solution of the initial value problem 


y' = 2x(l + \j), y(0) = 0. 


Starting with y 0 (x) = 0, calculate yfx), y 2 (x), t/ 3 (x), t/ 4 (x), and compare these 
results with the exact solution. 

3. It is instructive to see how Picard's method works with a choice of the 
initial approximation other than the constant function y 0 (x) = y 0 . Apply 
the method to the initial value problem (4) with 

(a) y 0 (x) = e x ; 

(b) y 0 (x) = 1+x; 

(c) yfx) = cos x. 


70 Picard's Theorem 

As we pointed out at the end of the last section, the principal value of Picard's 
method of successive approximations lies in the contribution it makes to the 
theory of differential equations. This contribution is most clearly illustrated 
in the proof of the following basic theorem. 

Theorem A. (Picard's theorem.) Letf(x, y) and df/dy he continuous functions of 
x and y on a closed rectangle R with sides parallel to the axes ( Figure 105). If (x 0 , y 0 ) 
is any interior point ofR, then there exists a number h> 0 with the property that the 
initial value problem 


y'=f(x,y), y(x 0 )-y 0 


( 1 ) 


has one and only one solution y = y(x) on the interval \x - x 0 | <h. 

Proof. The argument is fairly long and intricate, and is best absorbed in easy 
stages. 
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FIGURE 105 


First, we know that every solution of (1) is also a continuous solution of the 
integral equation 


y(x) = y 0 + j f[t,y(t)]dt, (2) 

*0 


and conversely. This enables us to conclude that (1) has a unique solution on 
an interval | x - x 0 1 < h if and only if (2) has a unique continuous solution on 
the same interval. In Section 69 we presented some evidence suggesting that 
the sequence of functions y n (x) defined by 


yo(x) = y 0 , 

X 

y 1 (x) = y 0 + jf[t,y 0 (t)\dt, 

XQ 

X 

y 2 (x) = y 0 + jf[t,yi(t)\dt, 

XQ 

X 

y n (x) = y 0 + jf[t,y„- 1 (t)]dt, 

*0 


converges to a solution of (2). We next observe that y n (x ) is the nth partial sum 
of the series of functions 
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ijo(x) + Z [J/ n(x) - y«-i{x)\ = y 0 (x) + [yfx) - y 0 {x)\ 

n =1 

+ M*) - yfx)\ + • ■ • + [ijn(x) - y n -i{x)\ + ■■■, (4) 

so the convergence of the sequence (3) is equivalent to the convergence of this 
series. In order to complete the proof, we produce a number h> 0 that defines 
the interval \x - x 0 1 <h and then we show that on this interval the following 
statements are true: (i) the series (4) converges to a function y(x); (ii) y(x) is a 
continuous solution of (2); (iii) y(x) is the only continuous solution of (2). 

The hypotheses of the theorem are used to produce the positive number h, 
as follows. We have assumed that/(x, y) and df/dy are continuous functions 
on the rectangle R. But R is closed (in the sense that it includes its boundary) 
and bounded, so each of these functions is necessarily bounded on R. This 
means that there exist constants M and K such that 

I f(x,y)\<M (5) 


and 


dy 


<K 


( 6 ) 


for all points (x, y) in R. We next observe that if (x, y,) and (x, y 2 ) are distinct 
points in R with the same x coordinate, then the mean value theorem guar¬ 
antees that 


\f(x,yf)-f(x,y 2 )\ 


d_ 

dy 


f(x,y*) 


yi-3/2 


(7) 


for some number y* between ty, and iy 2 . It is clear from (6) and (7) that 

\f(x,y 1 )-f(x,y 2 )\ <K|yi-y 2 | (8) 

for any points (x, y^ and ( x , iy 2 ) in R (distinct or not) that lie on the same verti¬ 
cal line. We now choose h to be any positive number such that 

Kh<l (9) 

and the rectangle R' defined by the inequalities |x - x 0 \ <h and |y - y 0 \ <Mh 
is contained in R. Since (x 0 , y Q ) is an interior point of R, there is no difficulty in 
seeing that such an h exists. The reasons for these apparently bizarre require¬ 
ments will of course emerge as the proof continues. 
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From this point on, we confine our attention to the interval | x - x 0 \ < h. In 
order to prove (i), it suffices to show that the series 

\y 0 (x) |+| yfx) - y 0 (x) |+| y 2 (x) - yi(x)]+ • • • +| y„(x) - y»-fx) j+ • • • (10) 

converges; and to accomplish this, we estimate the terms | y n (x) - y„_ 1 (x) |. It is 
first necessary to observe that each of the functions y n (x) has a graph that lies 
in R' and hence in R. This is obvious for y n (x) = i/ n , so the points ft, t/ n (f)l are in 
R', (5) yields |/[f,y 0 (f)]| <M, and 



which proves the statement for yfx). It follows in turn from this inequality 
that the points [f, y, (f)] are in R', so |/[f, y v (f)] | <M and 



Similarly, 



and so on. Now for the estimates mentioned above. Since a continuous func¬ 
tion on a closed interval has a maximum, and yfx) is continuous, we can 
define a constant a by a - max | yfx) - y 0 \ and write 


Next, the points [f, yft)] and [t, y 0 (f)] lie in R', so (8) yields 
I yM -fit, yM\ ^K\yft) - y 0 (t) I <Ka 


and we have 



< Kali = a(Kh). 
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Similarly, 

I f[t, y 2 (0] -f[t, yM\ zK\y 2 (t) - yft)\ <K 2 ah, 
so 


\y 3 (x)-y 2 (x)\ = 


X 

\(f[t,y 2 (t)]-f[t,ym)dt 


< ( K 2 ah)h = a(kh) 2 . 


By continuing in this manner, we find that 

|y„(*)-y„-i(*)l ^ a(Kh)"~ l 

for every n = 1, 2,... Each term of the series (10) is therefore less than or equal 
to the corresponding term of the series of constants 

\y 0 \ + a + a(Kh) + a(Kh) 2 + --- + a(Kh) n ~ 1 + •••. 

But (9) guarantees that this series converges, so (10) converges by the com¬ 
parison test, (4) converges to a sum which we denote by y(x), and y„(x) -» y(x). 
Since the graph of each y n (x) lies in R', it is evident that the graph of y(x) also 
has this property. 

Now for the proof of (ii). The above argument shows not only that y n (x) 
converges to y(x) in the interval, but also that this convergence is uniform. 
This means that by choosing n to be sufficiently large, we can make y„(x) 
as close as we please to y(x)for all x in the interval, or more precisely, if e>0 
is given, then there exists a positive integer n 0 such that if n > n 0 we have 
| y(x) - y n (x) | < e for all x in the interval. Since each y n (x) is clearly continu¬ 
ous, this uniformity of the convergence implies that the limit function y(x) 
is also continuous. 2 To prove that y(x) is actually a solution of (2), we must 
show that 


y(x)-y 0 -jf[t,y(t)\dt = 0. (11) 

*0 


2 We will not discuss this in detail, but the reasoning is quite simple and rests on the inequality 
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But we know that 


y«(x) - 3 /o - J/[h y„- 1 (£)] * = 0, (12) 

*0 


so subtracting the left side of (12) from the left side of (11) gives 


y(x) - i/o - jf[t, y(t)\dt = y{x) - y„(x) + J (f[t, y„_i(f)] - /[f, y(f)]) dt, 

XQ XQ 


and we obtain 


y(x)-y 0 - jf[t,y(t)]dt 
*0 


^\y{x)-y n (x)\ + 


x 

J(/lAy* 


f[t,y(t)])dt 


Since the graph of y(x) lies in R' and hence in R, (8) yields 


y(x)-y 0 -jf[t,iy(t)\dt 

X0 

<| y(x) - y„ (x) | +Kh max | y„_i(x) - y(x) |. (13) 

The uniformity of the convergence of y„(x) to y(x) now implies that the right 
side of (13) can be made as small as we please by taking n large enough. The 
left side of (13) must therefore equal zero, and the proof of (11) is complete. 

In order to prove (iii), we assume that y(x) is also a continuous solution 
of (2) on the interval |x - x 0 | <h, and we show that y(x) = y(x) for every x 
in the interval. For the argument we give, it is necessary to know that the 
graph of y(x) lies in R' and hence in R, so our first step is to establish this 
fact. Let us suppose that the graph of y(x) leaves R' (Figure 106). Then the 
properties of this function [continuity and the fact that y(x 0 ) = y 0 ] imply that 
there exists an x 1 such that |x x — x 0 1 <h, \ y(xi)-y 0 1= Mh and | y(x)-y 0 1< Mh 
if | x — x 0 1 < | Xj - x 0 1. It follows that 

|y(xi)-y 0 l _ Mh > Mh = M 
| x x - x 0 1 | x 1 - x 0 1 h 
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FIGURE 106 


However, by the mean value theorem there exists a number x* between x 0 
and x 1 such that 


|y(*i)-yp 

| *1 - x 0 1 


yV)l=l/[**/y(**)]|^M, 


since the point [ x*,y{x*)\ lies in R'. This contradiction shows that no point 
with the properties of x 1 can exist, so the graph of y(x) lies in R'. To complete 
the proof of (iii), we use the fact that y(x) and y(x) are both solutions of (2) to 
write 


|y(*)-y(*)l= 


| {/[f,y(f)]-/[f,y(f)]}df. 


Since the graphs of y{x) and y{x) both lie in R', (8) yields 
I y(x) - y(x) | < Kh max | y(x) - y(x) |, 


so 


max | y(x) - y(x) \ < Kh max | y(x)- y(x) |. 


This implies that max fj(x)-y(x)\ = 0, for otherwise we would have 1 <Kh 
in contradiction to (9). It follows that y(x) = y(x) for every x in the inter¬ 
val |x — x 0 1 <h, and Picard's theorem is fully proved. 


Remark 1. This theorem can be strengthened in various ways by weakening 
its hypotheses. For instance, our assumption that df/dy is continuous on R is 
stronger than the proof requires, and is used only to obtain the inequality (8). 
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We can therefore introduce this inequality into the theorem as an assump¬ 
tion that replaces the one about df/dy. In this way we arrive at a stronger form 
of the theorem since there are many functions that lack a continuous partial 
derivative but nevertheless satisfy (8) for some constant K. This inequality, 
which says that the difference quotient 

/(x,yi)-/(x,y 2 ) 

Vi-yi 

is bounded on R, is called a Lipschitz 3 condition in the variable y. 

Remark 2. If we drop the Lipschitz condition, and assume only that/(x,y) is 
continuous on R, then it is still possible to prove that the initial value problem 
(1) has a solution. This result is known as Peano's theorem , 4 The only known 
proofs depend on more sophisticated arguments than those we have used 
above. 5 Furthermore, the solution whose existence this theorem guarantees 
is not necessarily unique. As an example, consider the problem 

y' =3y 2/3 , y(0)=0, (14) 

and let R be the rectangle |x| <1, |y | <1. Here/(x,y) = 3y 2/3 is plainly continu¬ 
ous on R. Also, y-fx) =x 3 and y 2 (x) = 0 are two different solutions valid for all 
x, so (14) certainly has a solution that is not unique. The explanation for this 
nonuniqueness lies in the fact that/(x,y) does not satisfy a Lipschitz condi¬ 
tion on the rectangle R, since the difference quotient 

/(0,y)-/(0,0) 3 y 2/3 3 

y =o y y m 

is unbounded in every neighborhood of the origin. 


3 Rudolf Lipschitz (1832-1903) was a professor at Bonn for most of his life. He is remembered 
chiefly for his role in simplifying and clarifying Cauchy's original theory of the existence and 
uniqueness of solutions of differential equations. However, he also extended Dirichlet's theo¬ 
rem on the representability of a function by its Fourier series, obtained the formula for the 
number of ways a positive integer can be expressed as a sum of four squares as a consequence 
of his own theory of the factorization of integral quaternions, and made useful contributions 
to theoretical mechanics, the calculus of variations, Bessel functions, quadratic differential 
forms, and the theory of viscous fluids. 

4 Guiseppe Peano (1858-1932), Italian logician and mathematician, strongly influenced 
Hilbert's axiomatic treatment of plane geometry and the work of Whitehead and Russell on 
mathematical logic. His postulates for the positive integers have led generations of students 
to wonder whether all of modern algebra is some kind of conspiracy to render the obvious 
obscure (it is not!). In 1890 he astounded the mathematical world with his remarkable con¬ 
struction of a continuous curve in the plane that completely fills the square 0<x<l, 0<y<l. 
Unfortunately for a man who valued logic so highly, his 1886 proof of the above existence 
theorem for solutions of y' =f(x,y) was inadequate, and a satisfactory proof was not found 
until many years later. 

5 See, for example, A. N. Kolmogorov and S. V. Fomin, Elements of the Theory of Functions and 
Functional Analysis, vol, 1, p. 56, Graylock, Baltimore, 1957. 
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Remark 3. Theorem A is called a local existence and uniqueness theorem 
because it guarantees the existence of a unique solution only on some inter¬ 
val | x - x Q | <h where h may be very small. There are several important cases 
in which this restriction can be removed. Let us consider, for example, the 
first order linear equation 


y' + P{x)y=Q(x), 

where P(x) and Q(x) are defined and continuous on an interval a<x<b. Here 
we have 


f(x,y) = -P(x)y+Q(x); 

and if K = max| P(x) | for a<x<b, it is clear that 

l/foyi) ~f(x,yd\ = |-P(*)(j/i - 3/2)1 \y 1 -y 2 \. 

The function f(x,y) is therefore continuous and satisfies a Lipschitz condition 
on the infinite vertical strip defined by a<x<b and -°°<y<°°. Under these 
circumstances, the initial value problem 

y’ + P(x)y=Q(x), y(x 0 )=y 0 

has a unique solution on the entire interval a<x<b. Furthermore, the point 
(x 0 ,y 0 ) can be any point of the strip, interior or not. This statement is a special 
case of the next theorem. 


Theorem B. Letf(x,y) be a continuous function that satisfies a Lipschitz condition 

\f(x,yd -f(x,y 2 )I <K\y 1 -y 2 \ 

on a strip defined by a <x<b and -°°<y<°°. If (x 0 ,i/ 0 ) is any point of the strip, then 
the initial value problem 


y’=f(x,y), y(x 0 )=y 0 (15) 

has one and only one solution y=y(x) on the interval a<x<b. 

Proof. The argument is similar to that given for Theorem A, with certain 
simplifications permitted by the fact that the region under discussion is not 
bounded above or below. In particular, we start the proof in the same way 
and show that the series (4)— and therefore the sequence (3)—is uniformly 
convergent on the whole interval a<x<b. We accomplish this by using a 
somewhat different method of estimating the terms of the series (10). 
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First, we define M 0 , M v and M by 

M 0 = |y 0 |, Mj^maxli/^x)!, M = M 0 + M 1 , 

and we notice that |y 0 (x)| and |yj(x) - y 0 (x)| <M. Next, if x 0 <x<b, it fol¬ 
lows that 


|y 2 (x)-yi(x) = 


X 

1 


{f[t,yft)]-f[t,y 0 (t)]}dt 


X 

< J \f[t,yi(t)\-f[t,y 0 (t)\\dt 

XQ 

X 

|y 1 (f)-y 0 (f)|df 

*0 

< KM(x-Xq), 


|y 3 (x)-y 2 (x)| = 


X 

j{f[t,y 2 (t)-f[t,yi(t)]}dt 


x 

<fcj | y 2 (t)-yi(t)\dt 

X0 

X 

< K 2 M J (t-x 0 )dt = K 2 M (x ~ 2 Yo) 

*0 


and in general 


|y„(x)-y n _ 1 (x)|<X"- 1 M (:t * o ) "/ ■ 

(n- 1)! 

The same argument is also valid for a<x<x 0 , provided only that x - x 0 is 
replaced by | x - x 0 1, so we have 


|y»(x)-y,i-i(x)| < K n l M- 


-*0 I 


(«-!)! 


< K n - 2 M 


(b~ar 

(n-l)\ 
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for every x in the interval and n = 1, 2,.... We conclude that each term of the 
series (10) is less than or equal to the corresponding term of the convergent 
series of constants 



so (3) converges uniformly on the interval a<x<b to a limit function y(x). 

Just as before, the uniformity of the convergence implies that y(x) is a solu¬ 
tion of (15) on the whole interval, and all that remains is to show that it is the 
only such solution. We assume that y(x) is also a solution of (15) on the inter¬ 
val. Our strategy is to show that y„(x) —> y(x) for each x as ;wco; and since we 
also have y„(x) -* y(x), it will follow that y(x) = y(x). We begin by observing 
that y(x) is continuous and satisfies the equation 


X 



If A = max \y(x)- y 0 1, then for x 0 <x<b we see that 


X 


I y (*) - i/i W I = | {fit, y(t)] - fit, yo(0]} dt 


X0 

X 


< J \ f[t,y(t)\-f{t,y 0 (t)\\dt 


x 



< KA(x-Xq), 


X 


13 fix) - W W |=| {fit, y{t)] - f[t, yft)]}dt 


X 


\y(t)-yft)\dt 



2 
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and in general 

\y(x)-y n (x)\<K"A { - X ^ X ^ / 

' n\ 

A similar result holds for a<x<x 0 so for any x in the interval we have 

|y(x)-y„(x)I< K n A < K n A ^~ a ^ . 

n\ n ! 

Since the right side of this approaches zero as « we conclude that 
y(x) = y(x) for every x in the interval, and the proof is complete. 


Problems 

1. Let (x 0 ,y 0 ) be an arbitrary point in the plane and consider the initial 
value problem 


y'=y * 1 2 3 4 ,y(xo) = y 0 . 

Explain why Theorem A guarantees that this problem has a unique 
solution on some interval | x — x 0 1 <h. Since/(x,y)=y 2 and df/dy = 2y are 
continuous on the entire plane, it is tempting to conclude that this solu¬ 
tion is valid for all x. By considering the solutions through the points 
(0,0) and (0,1), show that this conclusion is sometimes true and some¬ 
times false, and that therefore the inference is not legitimate. 

2. Show that/(x,y)=y 1/2 

(a) does not satisfy a Lipschitz condition on the rectangle |x| <1 and 
0<y<l; 

(b) does satisfy a Lipschitz condition on the rectangle |x| <1 and 
c<y<d, where 0<c<d. 

3. Show that/(x,y) = x 2 |y| satisfies a Lipschitz condition on the rectangle 
| x | <1 and | y | <1 but that df/dy fails to exist at many points of this 
rectangle. 

4. Show that/(x,y) = xy 2 

(a) satisfies a Lipschitz condition on any rectangle a<x<b and c<y<d; 

(b) does not satisfy a Lipschitz condition on any strip a<x<b> and 
—°° <y<°°. 
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5. Show that f(x,y) = xy 

(a) satisfies a Lipschitz condition on any rectangle a<x<b and c<y<d; 

(b) satisfies a Lipschitz condition on any strip a<x<b and < y < 

(c) does not satisfy a Lipschitz condition on the entire plane. 

6. Consider the initial value problem 

y'=y\y\ > y(*o)=y 0 - 

(a) For what points (x 0 ,y 0 ) does Theorem A imply that this problem has 
a unique solution on some interval \x - x 0 | <hl 

(b) For what points (x 0 ,y 0 ) does this problem actually have a unique 
solution on some interval | x - x 0 1 < hi 

7. For what points (x 0 ,y 0 ) does Theorem A imply that the initial value 
problem 

y'=y|y|/ y(*o)=y 0 

has a unique solution on some interval \x - x 0 | <hl 


71 Systems. The Second Order Linear Equation 

Picard's method of successive approximations can also be applied to sys¬ 
tems of first order equations. Let us consider, for example, the initial value 
problem consisting of the following pair of first order equations and initial 
conditions: 


7 ~ = f(x,y,z), y(x 0 ) = y 0 , 

ax 

-T- = g(x,y,z), z(x 0 ) = z 0 , 

dx 


( 1 ) 


where the right sides are continuous functions in some region of xyz space 
that contains the point (x 0 ,y 0 ,z 0 ). We use the differential notation here in order 
to emphasize that x is the independent variable. A solution of such a system 
is of course a pair of functions y = y(x) and z = z(x) which together satisfy the 
conditions imposed by (1) on some interval containing the point x 0 . As in 
the case of a single first order equation, it is apparent that the system (1) is 
equivalent to the system of integral equations 





The Existence and Uniqueness of Solutions 


639 


y{x) = l/o + J f[t,y(t),z(t)]dt, 

*0 

X 

z(x) = z 0 + 1 g[t,y(t),z(t)]dt, 

*0 


( 2 ) 


in the sense that the solutions of (1)—if any exist—are precisely the continu¬ 
ous solutions of (2). If we attempt to solve (2) by successive approximations 
beginning with the constant functions 

y 0 (x) = y 0 and z 0 (x)=z 0 , 

then the Picard method proceeds exactly as before. At the first stage we 
have 


X 

yi(x) = j/o + J f[t,y 0 {t),z 0 (t)]dt r 

XQ 

< 

X 

Zi(x) = z 0 +1 g[t,y 0 (t),z 0 (t)]dt; 
*0 

at the second stage we have 


iy 2 (x) = ij 0 + J f[t, yft), Zi(f)] dt, 

XQ 

X 

z 2 (x) = Z 0 + 1 g[t, yft), Zi(f)] dt; 

*0 


and so on. This procedure generates two sequences of functions y„(x) and 
z„(x); and under suitable hypotheses, the arguments of Theorem 69-A can 
easily be adapted to prove that these sequences converge to a solution of (1) 
which exists and is unique on some interval |x — x 0 1 <h. 

We now specialize to a linear system, in which the functions f(x,y,z) and 
g (x,y,z) in (1) are linear functions of y and z. That is, we consider an initial 
value problem of the form 
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= Pi(x)y + (ji(x)z + n(x), 
ax 

dz 

■ = Pi{x)ij + q 2 (x)z + r 2 (x), 

ax 


z(x 0 ) = z 0 , 


V(x o) = ijo, 


(3) 


where the six functions p,(x), q,(x), and rfx) are continuous on an interval 
a<x<b and x 0 is a point in this interval. Since each of these functions is 
bounded for a<x<b, there exists a constant K such that | p t (x) |< K and | q t (x) \<K 
for i= 1,2. It is now easy to see that the functions on the right sides of the dif¬ 
ferential equations in (3) satisfy Lipschitz conditions of the form 


|/(x,yi,Zi) -f(x,y 2 ,z 2 )\ <K(\y x - y 2 \ + |z x - z 2 |) 


and 


\g(x,yi,zf - g(x,y 2 ,z 2 )\ <K(\y x - y 2 \ + \z x - z 2 \). 


Just as in the proof of Theorem 69-B, these conditions can be used to show 
that (3) has a unique solution on the whole interval a<x<b. Again we spare 
the reader the details. 

These remarks about systems make it possible to give a simple proof of 
the following basic theorem, which we stated at the beginning of Chapter 3 
and which has played an unobtrusive but crucial role in all of our work on 
second order linear equations. 

Theorem A. Let P(x), Q(x), and R(x) be continuous functions on an interval a<x<b. 
If x 0 is any point in this interval, and y 0 and y' 0 are any numbers whatever, then the 
initial value problem 



has one and only one solution y = y(x) on the interval a<x<b. 

Proof. If we introduce the variable z-dy/dx, then it is clear that every solu¬ 
tion of (4) yields a solution of the linear system 



dx 


y{x o) = I/O, 


-f ; = -P(x)z-Q(x)y + R(x), z(x 0 ) = i/o. 


( 5 ) 
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and conversely. We have seen that (5) has a unique solution on the interval 
a < x < b, so the same is true of (4). 


Problem 

1. Solve the following initial value problem by Picard's method, and com¬ 
pare the result with the exact solution: 


' dy_ 
dx 
dz 



2/(0) = 1, 


z(0) = 0. 
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72 Introduction 

Despite the broad range of powerful analytical tools presented throughout 
this book, many occasions cry out for the application of numerical meth¬ 
ods for solving ordinary differential equations. For example, an exact solu¬ 
tion may be unavailable, or may be of little practical value. 1 This situation 
occurs when power series solutions to linear second order equations are 
constructed. In general, the series are rather good approximations near the 
initial condition, but the Taylor expansions can soon require prohibitively 
many terms should the solution be required at some large distance from that 
point. For large systems of equations, an exact solution may exist (in vec¬ 
tor form) but the subsequent algebraic manipulations may be overwhelm¬ 
ing. Furthermore, numerical solutions should not be cast in a light of last 
resort, for they form the mathematician's petri dish—a crucible in which he 
can conduct any number of experiments on his differential equation and, by 
proxy, the very thing he is trying to model. 2 

These numerical methods rely on two fundamental but distinct approx¬ 
imations. First, a differential equation is replaced with a difference 
equation and the role played by a continuous independent variable is 
then assumed by a discrete one. For this approach to be of any use, it is 


1 For a detailed historical account of the important role played by the application of numerical 
methods to differential equations, see Garrett Birkhoff's "Numerical Fluid Dynamics," the 
1981 John von Neumann Lecture, published in SIAM Review vol. 25, pp 1-34 (1983). 

2 In 1965, N. J. Zabusky and M. D. Kruskal discovered solitons in just this way. By considering 
a particular version of an equation governing the motion of surface water waves and exper¬ 
imenting with its numerical solution, they deduced the existence of mathematical objects 
with truly surprising properties. Solitons and the differential equations that govern their 
behavior have been one of the most intensely studied areas of applied mathematics during 
the last two decades. 
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important to understand the conditions under which the solution to the 
difference equation is close to, that is, converges to, the solution to the 
differential equation. Second, in virtually all digital computers in use 
today, the real-number line is approximated by a large but finite subset 
of rational numbers. Limiting oneself to only a finite range of rationals 
can have unobvious, but crucial, consequences in certain cases—the errors 
made by the machine may indeed be catastrophic. At any rate, both of 
these approximations permit the difference equations to be implemented 
on an enormous variety of computing hardware. Nevertheless, there are 
many apocryphal stories told of engineers performing expensive compu¬ 
tations on big computers only to obtain nonsense answers. We emphasize 
here that existence and uniqueness questions, discussed elsewhere in this 
book, are vitally important and should always be considered first. Beyond 
these, other problems, such as numerical instability and the existence of 
spurious solutions can cause difficulties. Despite the abundance of well- 
tuned algorithms for solving ordinary differential equations, the reader 
should carefully remark the need to be ever-vigilant. Before appealing to 
the machine for aid, it is always wise to know something about the answer 
one seeks. That is, the practicing scientist should endeavor to know as 
much about the solution as is possible. For example, is it bounded? Stable? 
Periodic? About how big (or small) should the answer be? Careful attention 
to these issues as discussed in the preceding chapters will stand the reader 
in good stead for what follows. 3 

In order to understand what we mean by a numerical solution of a differ¬ 
ential equation, we consider the simple initial-value problem 


y'=y,y{ o)=i. (i) 

The problem has the obvious solution y = e x , and for many theoretical pur¬ 
poses, this is enough. However, in a practical application it might be neces¬ 
sary to know the value of the solution when x = 0.5, and the decimal 1.649 is 
likely to be more useful than the symbol e 03 . In contrast to the theoretical 
solution of (1), a numerical solution can be provided by a table of values for e x 
or a pocket calculator. Either way, the number so obtained depended on our 
knowledge of the formula y = e x . 

In this chapter we describe several methods of calculating an approxima¬ 
tion numerical solution of the form 


y'=f{x,y), y(x 0 )=yo- 


( 2 ) 


3 For an excellent historical background on the evolution of numerical methods for differen¬ 
tial equations that occurred in the decades surrounding the development of the first digital 
computers, see Herman H. Goldstine, The Computer from Pascal to von Neumann, Princeton 
University Press, Princeton, 1972. 
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We shall assume that this problem has a unique solution denoted by y(x). 
Our methods consist of a computational procedures based solely on the 
information given by (2), and are completely independent of whether a for¬ 
mula for y(x) is known or not. These numerical methods and others like them 
are therefore extremely valuable for those initial-value problems that cannot 
be solved exactly, and also for those having exact formal solutions that are 
practically intractable. 4 

Let us be a little more specific about the nature of these methods. We shall 
not approximate the exact solution y(x) for all values of x in some interval, but 
only for a discrete sequence of points beginning at x 0 , say 

x 0 , x 1 -x 0 +h, x 2 -x l +h ,..., x n =x n _ x + h, 

where h is a positive number. This means that we want an approximation 
y 1 to the exact value y(x,), an approximation y 2 to the exact value y(x 2 ), 
and so on. Each numerical method we describe will be a rule for using y k 
to compute y k+2 . 5 Since we know the initial value y{x 0 ) = y 0 (this is exact), 
we can apply the rule with k = 0 to obtain y lr with n -1 to obtain y 2 , etc. 
Our general purpose is to apply enough of the details of each method to 
enable the reader to apply it for himself if the need should ever arise. We 
avoid details dealing with the plethora of computing machines and pro¬ 
gramming languages for several reasons. First, those issues are best left to 
specialized texts in numerical analysis. Second, it is our experience that 
virtually all students have some familiarity with computing fundamen¬ 
tals and should be able to write programs where appropriate to perform 
the calculations required by the exercises in this chapter. As to the means, 
that is better left to the student and his teacher. Third, advances in com¬ 
puting continue at a dizzying pace, and we see no need to burden this 
book with nonmathematical details that might well be obsolete in only a 
few short years. 

We shall illustrate our methods by applying them to the simple problem 

y'-x + y, y(0) = l, (3) 

which we call our benchmark problem. This differential equation in (3) is 
clearly linear, and the exact solution is easily found to be 

y- 2e x -x-l. (4) 


4 The noted American mathematician R. W. Hamming said that "the purpose of computing 
is insight, not numbers." Even so, it takes more than insight to build a skyscraper or a space 
shuttle. 

5 These are so-called single-step methods. There are also various multistep methods in which 
y t+1 depends not only on y k , but possibly on y k _ 1 and earlier terms. 
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We have chosen (3) as our benchmark problem for two reasons. First, it is so 
simple that a numerical method can be applied to it by hand without obscur¬ 
ing the main steps by a morass of computations. Second, the exact solution 
(4) can easily be evaluated for various x's with the aid of a pocket calculator, 
so we have a means of judging the accuracy of the approximate solutions 
produced by our numerical methods. 


Problem 

1. Have you encountered any examples in other courses where either the 
textbook or the instructor referred to numerical solutions of ordinary 
differential equations? Give an example and discuss what you read or 
heard. 


73 The Method of Euler 

If we integrate the differential equation in (2) from x 0 to x 1 - x Q + h, and use the 
initial condition y(x 0 ) = t/ 0 , we obtain 



or 



( 1 ) 


Since the unknown function y = y(x) occurs under the integral sign in (1), 
we can go no further without some sort of approximation to this integral. 
Different types of approximations correspond to various methods for 
numerically solving (2). 

The Euler method is obtained from the simplest way of approximating the 
integral in (5). It is worth considering because it paves the way for an under¬ 
standing of other more accurate but more complicated methods. The idea is 






Numerical Methods 


647 


to obtain y 1 —our approximation to 1 /( 1 ',)—by assuming that the integrand 
f(x,y) in (5) varies so little over the interval x 0 < x < x 1 that only a small error 
is made by replacing it by its value f(x 0 ,y 0 ) at the left endpoint. This is equiva¬ 
lent to replacing the integrand in (5) with its zeroth order Taylor polynomial, 
that is. 


f(x,y)^f{x 0 ,y 0 ) + R, 


( 2 ) 


where 


K(x) = I f%y(Q) +f y fe,y(Q)y'(Q](x - x 0 ), 

where R is the Taylor remainder term ,f y -df/dy and x 0 < ^ < x. Noting that 
y" =/' +f y y’, we substitute (2) into (5) to obtain 

3/i = Vo + hf{x Q ,y 0 ) + — y"©. 

We suppose that h 2 y"(Q/2 is "small" in an appropriate sense and neglect 
the term. How small is small in general, and more particularly, when this 
term is small are important issues that will be discussed in more detail 
later. (See Problem 6, Section 74, for a related discussion.) Neglecting this 
term, we have 


l/i = }/o+ hf(x 0 ,y 0 ), (3) 

We now continue and obtain y 2 from y, in the same way, by the formula 
y 2 = J/i + hf(x,y ); and in general we have 


yk + i=yk+hf(x k ,\ji). (4) 

for k = 0, 1, . . ., n. The geometric meaning of these formulas is shown in 
Figure 107, where the smooth curve is the unknown exact solution which 
is being approximated by the piecewise-linear curve generated constructed 
from (8). To understand this figure, remember that/(x 0 ,y 0 ) is the slope of the 
tangent line to the curve at the initial point (x 0 ,y 0 ). The point y, is found by 
constructing a line segment beginning at (x 0 ,y 0 ) with that slope and march¬ 
ing it in the positive x direction a distance of h. That point becomes the sec¬ 
ond approximation to the solution. The figure indicates the vertical distance 
between the solution and the approximation as the error at the first stage. 
An important quantity derived from this, is the total relative error E n at the 
nth step, defined to be 


£ _ | t/(x„) + y„ 

I !/(*») | 


( 5 ) 
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FIGURE 107 


This quantity is often expressed as a percentage, providing a comfortable 
way to gauge how accurately the numerical solution is performing. Now, 
using (x lr yf the process is repeated again to obtain the next point at (x 2 ,i/ 2 ), 
also shown in the figure. The geometric realization of the Euler method sug¬ 
gests that error can build up rather quickly, which is, in general, true. 

We illustrate the Euler method by applying it to the benchmark problem 
(3). We approximate the solution at the points x n = 0.2, 0.4, 0.6, 0.8, and 1.0 by 
using intervals of length h = 0.2. It is convenient to arrange the calculations 
as shown in Table 1. In the first line of this table, the initial condition y = 1 
when x = 0 determines the slope y'-x + y = 1.00. Since h = 0.2 and t/, = y 0 + hf 
( x 0 ,y 0 ), the next value is given by 1.00+ 0.2(1.00) = 1.20. This approximation 
is shifted to the y n in the second line and the process is repeated to find 
y 2 , which turns out to be 1.48. In the table (and most remaining examples). 


TABLE 1 

Tabulated Values for Exact and Numerical 
Solutions to (3) with h = 0.2 


Xn 

Vn 

Exact 

E„ (%) 

0.0 

1.00000 

1.00000 

0.0 

0.2 

1.20000 

1.24281 

3.4 

0.4 

1.48000 

1.58365 

6.5 

0.6 

1.85600 

2.04424 

9.2 

0.8 

2.34720 

2.65108 

11.5 

1.0 

2.97664 

3.43656 

13.4 
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TABLE 2 

Tabulated Values for Exact and Numerical 
Solutions to (3) with h = 0.1 


x n 

y» 

Exact 

E„ (%) 

0.0 

1.00000 

1.00000 

0.0 

0.1 

1.10000 

1.11034 

0.9 

0.2 

1.22000 

1.24281 

1.8 

0.3 

1.36200 

1.39972 

2.7 

0.4 

1.52820 

1.58365 

3.5 

0.5 

1.72102 

1.79744 

4.3 

0.6 

1.94312 

2.04424 

4.9 

0.7 

2.19743 

2.32751 

5.6 

0.8 

2.48718 

2.65108 

6.2 

0.9 

2.81590 

3.01921 

6.7 

1.0 

3.18748 

3.43656 

7.2 


we retain five figures after the decimal point, and the resulting approximate 
value of y(l) is 2.97664. The exact value found from (4) is 3.43656, so the error 
is about 13 percent. If we carry out a similar calculation with h = 0.1, then the 
resulting approximation for y(l) is 3.18748, and the error is reduced to about 
7 percent, roughly half of what it was in the first instance. Table 2 displays 
the intermediate results of the Euler method for the benchmark problem in 
this case. 

We can therefore improve the accuracy of the method by taking smaller 
values of h, but at the expense of more computational work. Even so, after a 
certain point, reducing the step size will only make errors worse as will be 
discussed in the next section. 


Problems 

For the following problems, use the Euler method with h = 0.1, 0.05, and 0.01 
to estimate the solution at x= 1. Compare your results to the exact solution in 
each instance and discuss how well (or badly!) the Euler method performs. 


1. y'=2x + 2y,y(0) = l. 

2. y' = l/y,y(0) = l. 

3. y'=eby(0) = 0. 

4. y' = y - sinx, y(0) = -l. 

5. y'=(x+y- l) * 1 2 3 4 5 ,y(0) = 0. 
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6. This problem illustrates the danger in blindly applying numerical meth¬ 
ods. Employ the Euler method to the following initial value problem: 

y' = sec 2 x, y(0) = 0. 

Use a step size of h = 0.1 and determine the numerical solution at x = l. 
Explain why the initial value problem has no solution at x = 1. 

7. Refer to Figure 107. From geometric arguments, for what kind of exact 
solutions might the Euler method give precise results? Do these results 
depend on h in any way? Construct two distinct examples to illustrate 
your ideas. 

8. The ordinary differential equation 

y'=y(i-y 2 ), 

possesses two equilibrium solutions: (|), = 0, which is unstable, and 
cj) 2 = 1, which is stable. With the initial condition i/(0) = 0.1, predict what 
should happen to the solution. Then, with h=0.1, use the Euler method 
to march the solution out until x = 3. What happens to the numerical 
solution? 


74 Errors 

The notion of error is of crucial importance in the study of numerical meth¬ 
ods and we will give the idea some special consideration here. We mentioned 
in the previous section that reducing the step size in the Euler method can 
be very costly. This occurs for two reasons. First, the number of computa¬ 
tions is directly proportional to the number of steps taken. Thus, raising the 
accuracy raises the computational cost. Secondly, a phenomenon known as 
round-off error can become important. This is a result of any computer's 
ability to represent only a finite subset of rational numbers. 


Example. Consider the benchmark problem (3). Let us examine what 
happens if h is made too small. Let us suppose that our calculator has 
nine decimal digits of precision. Let h = 10 -10 , a very small step size that 
would seem to yield very accurate answers. Applying the Euler method 
and computing the first step, we find that the calculator obtains 

yi = yoW(hd/o) = l + 10- 10 = l! (1) 

The last equality in (1) is not a misprint. Because of its limited precision 
ability, the calculator represents iq as exactly 1. Unfortunately, the same 
thing will happen to y, as well. In this instance, the Euler method would 
predict a constant solution to the test problem, and round-off error has 
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produced a numerical disaster. A detailed analysis of round-off error is 
beyond the scope of this text. 6 As a result, we will concentrate exclusively 
on discretization error in the rest of this chapter, assuming that round-off 
error is always negligible. 7 


The local discretization error at the nth step is defined to be £ n -y(xJ - y n . 
(This assumes that y n is exactly correct.) As shown in the previous section, 
for the Euler method, this quantity is given by 





( 2 ) 


where x k _ t < t, < x k . First, note that on the interval x 0 <x<x n , the quantity y "(x) 
is bounded by a positive constant M which is independent of h. Thus, |£ t .| < 
Mil 2 / 2. Reducing the step size by a factor of 2 reduces the error bound on the 
local discretization error by a factor of 4, for example. 

Unfortunately, the story is a bit more complicated than this, since there is 
nothing to prevent these local errors from accumulating as many steps are 
taken. This leads to the notion of total discretization error at the nth step, E n . 
To estimate this quantity, note that, as the numerical solution is marched 
from x 0 to x n , n steps are taken, and n = (x„ - x 0 )/h. Assuming the worst case, 
that is, that local errors always add together and never cancel, a heuristic 
bound for the total error can be obtained: 



= (x„-x 0 ) 


Mh 

2 


So, for the Euler method, the total discretization error is never greater than 
some constant times the step size. 

To illustrate these ideas, let us estimate the discretization errors associ¬ 
ated with the benchmark problem (3). First, note that y" = 2e x . It is easy to see 
that on 0 < x < 1, this quantity assumes its largest value at x = l. Thus, |G„| < 
eh 2 . The total error is bounded as well, with |£„| < eh. Referring to Table 1 in 
Section 73, with h = 0.2, the total discretization error at x = 1 is 0.46 (rounded 
to two decimal places). The error bound is e(0.2) = 0.54, and, as expected, the 
total error is less than the bound. With h = 0.1, the appropriate numbers can 
be obtained from Table 2 in Section 73. The total error is 0.25 while the error 
bound is 0.27. 

We close this section with some practical advice. Since, in many problems 
of concern, the exact solution is not available for calculating an error bound, 
how does one know when h is "small enough?" One way used in practice is 


6 But see Chapter 1 of R. L. Burden and J. D. Faires Numerical Analysis, 4th ed., PWS-Kent, 
Boston, 1989, for a very thorough discussion. 

7 Caveat computer. 
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to calculate the numerical solution several times, successively halving the 
step size h. When the results no longer change within the precision desired, 
it is a good, but not infallible, bet that h is small enough. By the same token, 
how can one check to see whether h is "too small," that is, that round-off 
error is not creeping into the problem. One technique is to repeat a calcula¬ 
tion using extended precision arithmetic. Most programming languages and 
most computers support this capability. When re-calculated with extended 
precision, if the numerical results change in any substantial way, it is almost 
a sure thing that serious round-off errors are occurring. Nevertheless, this 
test is not foolproof, for it is always possible that the errors will not be vis¬ 
ibly manifested even at extended precision. Never forget that, as powerful as 
computers and numerical methods are, they must be used with care. 


Problems 

For the following problems, use the exact solution, together with step sizes 
h = 0.2 and 0.1 to estimate the total discretization error that occurs with the 
Euler method at x = 1. 

1. y'=2x + 2y,y(0) = l. 

2. y' = l/y, y(0) = l. 

3. y'= e .v,y(0) = 0. 

4. y'=y - sinx, y(0) = -l. 

5. y' = (x + y - l) 2 , y(0) = 0. 

6. Consider the problem y' = sin 3jix, with y(0) = 0. Determine the exact 
solution and sketch the graph on the interval 0 < x < 1. Use the Euler 
method with h = 0.2 and h = 0.1 and sketch those results on the same 
axes. Discuss. Now, use the results in this section to calculate a step 
size sufficient to guarantee a total error of 0.01 at x = 1. Apply the Euler 
method with this step size, and compare with the exact solution. Why 
is this step size so small? 


75 An Improvement to Euler 

Errors of this magnitude (13 and 7 percent) are obviously unsatisfactory. 
They can be reduced considerably by using much smaller values of //, but 
this can have its hazards as discussed in Section 74 and a better approach 
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is to develop more accurate methods. For example, it is not unreasonable 
to expect an improvement if we approximate the integrand (5) by the aver¬ 
age of its values at the left and right endpoints of the interval, that is, by 
1 

— \f(x 0 ,y 0 ) +f(x 1 ,y(x 1 ))]. This is equivalent to using the trapezoidal rule for 
approximating the definite integral in (5). Making the substitution, we get 

h 

yi = yo + -[f(x 0 ,y 0 ) + f(x 1 ,y(x 1 ))\. (1) 

The difficulty with (1) is that y(x j) is unknown. Flowever, if we replace j/(x,) 
by its approximate value as found by the simpler Euler method, which we 
denote by z, = y 0 + hf(x 0 ,y 0 ), then (1) assumes the usable form 

h 

yi=yo+—[f( x o,yo)+f(x lr zi)\. (2) 


More generally. 


yk+i = y» + -[/(**, y*)+/(**+i,z*+i)]. 


( 3 ) 


where 


z M =y k +hf{x k ,y k ). (4) 

This method, usually called the improved Euler method or Fleun's 8 method, 
first predicts, then corrects an estimate for y k , it is a simple example of a class of 
numerical techniques called predictor-corrector methods. The local truncation 
error for this method can be shown to be £ k =-y'"(£ > )h 3 /12 with x k <^< x k ; as 
a result, the total truncation error is proportional to h 2 , and we expect more 
accuracy for the same step size. 

One way to visualize the improved Euler method is depicted in Figure 108. 
First, the point at (x lf z k ) is predicted using the Euler method. This point is 
used to estimate the slope of the solution curve at x v This is then averaged 
with the original slope estimate at (x 0 ,y 0 ) to make a better prediction of the 
solution, namely {x v y^). 


Karl Heun (1859-1929) was a contemporary of C. Runge and R. Kutta (q.v.). He made con¬ 
tributions to classical mechanics, the theory of special functions, and Gaussian quadrature 
methods. 
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FIGURE 108 


To see just how much improvement is obtained, let us apply (3) and (4) 
to our benchmark problem (3) with a step size of h = 0.2. These formulas 
become 


z k+ i=y k +0.2(x k +y^ 


and 


y*+i =Vk + +34) + (*/ 1 + 1 + Zk + i)\- 

To begin the calculations we set k = 0 and use the initial values x 0 = 0.0 and 
y 0 - 1.0000 to write 


Zi = 1.000 + 0 . 2 ( 0.0 + 1 . 000 ) = 1.200 


and 


\j x = 1.000 + 0.1[(0.0 +1.000) + (0.2 +1.2000)] -1.240. 

Table 1 shows the approximate values of the solution obtained at the points 
x n = 0.2,0.4,0.8, and 1.0 by continuing this process. The resulting approximate 
value for y(l) is 3.40542. The error with this method is therefore about 1 per¬ 
cent, which is a substantial improvement over the result obtained with the 
Euler method and the same step size. 

With a smaller step size, results are even better. Table 2 displays the results 
of applying the improved Euler method to (3) using a step size of h=0.1. The 
relative error at x = 1.0 has been decreased to about 0.2 percent, roughly a 
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TABLE 1 

Tabulated Values for Exact and Numerical Solutions to (3) with 
h = 0.2 Using the Improved Euler Method 


X„ 

Vn 

Exact 

E n (%) 

0.0 

1.00000 

1.00000 

0.00 

0.2 

1.24000 

1.24281 

0.23 

0.4 

1.57680 

1.58365 

0.43 

0.6 

2.03170 

2.04424 

0.61 

0.8 

2.63067 

2.65108 

0.77 

1.0 

3.40542 

3.43656 

0.91 


TABLE 2 

Tabulated Values for Exact and Numerical Solutions to (3) with 
h = 0.1 Using the Improved Euler Method 


x n 

y» 

Exact 

E„ (%) 

0.0 

1.00000 

1.00000 

0.0 

0.1 

1.11000 

1.11034 

0.0 

0.2 

1.24205 

1.24281 

0.1 

0.3 

1.39847 

1.39972 

0.1 

0.4 

1.58180 

1.58365 

0.1 

0.5 

1.79489 

1.79744 

0.1 

0.6 

2.04086 

2.04424 

0.2 

0.7 

2.32315 

2.32751 

0.2 

0.8 

2.64558 

2.65108 

0.2 

0.9 

3.01236 

3.01921 

0.2 

1.0 

3.42816 

3.43656 

0.2 


fourth of that found previously. Since the total discretization error is propor¬ 
tional h 2 , halving the step size leads to the result indicated above. 

Clearly, there is a substantial improvement in the accuracy of the improved 
Euler method at a rather modest increase in the complexity of the formula. 
Suppose, however, that even more accuracy is desired. Decreasing the step 
size will work, though, as with the Euler method, it takes longer and will 
eventually produce unacceptably large errors. There are two main directions 
in which the strategy of increasing accuracy can be pursued. Perhaps the 
most natural one is to consider more accurate approximations to the inte¬ 
grand in (5). There are two fundamental ways in which this can be done: by 
using a polynomial approximant for/(x,y) in the interval [x 0 ,Xj] or by subdi¬ 
viding the interval. The latter method gives rise to the Runge-Kutta meth¬ 
ods, which will be described in the next section. The former approach leads 
to the multiterm Taylor methods, one of which we briefly describe below. 
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First, we determine the first order Taylor polynomial for f(x,y) about the 
point x = x 0 \ 


f(x,y) =f(x 0 ,y 0 ) + [f\x,y) +f y (x,y)i/)(x -x 0 ). 

We then substitute this into (5) to obtain the three-term Taylor scheme: 


yk+i - yk + hf( x o/yo)+ 



( 5 ) 


where we have used the fact that y" = [f(x,y)\. The local truncation error is 
€ k =y'"(<f)h 3 /12 where x 0 < qx n . The total truncation error is proportional to h 2 . 
Consequently, (5) is expected to perform comparably to (3). 

Table 3 displays the results of applying (5) to (3) with h = 0.1. At x = l, 
this method produces results identical (to the number of decimal places 
shown) to those obtained with the improved Euler method. Obviously, bet¬ 
ter accuracy can be obtained by retaining more terms in the Taylor series 
(see Problem 8). The drawback to this approach comes from the need to 
evaluate higher-order derivatives of f(x,y). These derivatives can become 
unwieldy in a hurry, slowing down the calculation time for a given problem 
significantly. Even more,/(.r,iy) may not be available in analytical form. For 
example, it could consist of discrete experimental data or itself might be the 
result of a numerical computation. As such, higher order derivative calcula¬ 
tions are likely to be so inaccurate as to nullify any gain that might exist 
in principle. Thus, multiterm Taylor methods are seldom used in practice. 


TABLE 3 


Tabulated Values for Exact and Numerical Solutions to (3) with 
h = 0.1 Using the Three-Term Taylor Method 


x„ 

Vn 

Exact 

E„ (%) 

0.0 

1.00000 

1.00000 

0.0 

0.1 

1.11000 

1.11034 

0.0 

0.2 

1.24205 

1.24281 

0.1 

0.3 

1.39847 

1.39972 

0.1 

0.4 

1.58180 

1.58365 

0.1 

0.5 

1.79489 

1.79744 

0.1 

0.6 

2.04086 

2.04424 

0.2 

0.7 

2.32315 

2.32751 

0.2 

0.8 

2.64558 

2.65108 

0.2 

0.9 

3.01236 

3.01921 

0.2 

1.0 

3.42816 

3.43656 

0.2 
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There exist much better ways to gain the accuracy needed with far less 
computational cost, as will be discussed in the next section. 


Problems 

For the following problems, use the improved Euler method with h = 0.1,0.05, 
and 0.01 to estimate the solution at x = l. Compare your results to the exact 
solution and the results obtained with the Euler method in Section 73. 

1. y'=2x + 2y,y(0) = l. 

2. y' = l/y,y(0) = l. 

3. y'= e .v,y(0) = 0. 

4. y' =y - sinx, y(0) = -l. 

5. y' = (pc + y- l) 2 ,y(0)=0. 

6. Think of some examples for which the three-term Taylor method 
might work better than the improved Euler method. In each instance, 
describe why and, if possible, use a computer or calculator to illustrate 
the problem. 

7. Think of some examples for which the three-term Taylor method might 
work poorly. In each instance, describe the source of difficulty. If pos¬ 
sible, use a computer or calculator to illustrate the problem. 

8. Derive an expression for the four-term Taylor method. Apply it to the 
benchmark problem (3) with a step size of h = 0.1 and calculate the 
solution out to x = l. Is any accuracy gained over the three-term Taylor 
method? 


76 Higher Order Methods 

As with the improved Euler methods discussed in Section 75, the Runge- 
Kutta 9 methods can be derived from (5) by using a different approximation 
for the integral. Let us consider Simpson's rule. In this instance. 


9 Carl Runge (1856-1927) was professor of applied mathematics at Gdttingen from 1904 to 1925. 
He is known for his work on the Zeeman effect and for his discovery of a theorem that fore¬ 
shadowed the famous Thue-Siegel-Roth theorem in Diophantine equations. He also taught 
Hilbert to ski. M. W. Kutta (1867-1944), another German applied mathematician, is remem¬ 
bered for his contribution to the Kutta-Joukowski theory of airfoil lift in aerodynamics. 
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J f{x, y)dx = 1 [f(x 0 ,y 0 ) + 4 f(xy 2 , y(x V2 )) + f(x,,y{x,))], (1) 


where x 1/2 -x 0 + h/2. A rigorous derivation of the fourth order Runge-Kutta 
method is beyond the scope of this chapter. Rather than simply state the 
results, we give here an intuitive development of this extremely important 
scheme for solving ordinary differential equations. 10 

In much the same way as we applied the other integration formulas, we 
must make estimates of both y 1/2 and y v The first estimate of y 1/2 is obtained 
from Euler's method: 


yi/2 - yo + —, (2) 

where m l = hf(x 0 ,y 0 ). The factor of 1/2 is necessary since the step size from x 0 
to x 1/2 is h/1. To correct this estimate of y 1/2 , we calculate it again in the fol¬ 
lowing way: 


y 1/2 - yo + —/ ( 3 ) 

where now m 2 = hf(x 0 + h/ 2, yo + nq/2). Now, to predict y 1 we use this latter 
estimate for y 1/2 and the Euler method: 


iit-A , -<. 

yi =yi/2 + —, ( 4 ) 

where now m 3 =hf(x 0 + h/2, y 0 + m 2 /2). Finally, we let m A =hf{x+h, y 0 +m 3 ). The 
Runge-Kutta method is then obtained from substituting each of these esti¬ 
mates into (1) to obtain 


Vi = y 0 + 7(^1 + 2 m 2 +2 m 3 +m A ). 
o 


( 5 ) 


As with all previous methods, this one can be extended to any number of 
mesh points in the natural way. At each step, first compute the four numbers 
m v .. ., m A : 


10 It is worth noting that more than one fourth order Runge-Kutta formula can be derived. See 
B. Carnahan, H. A. Luther, and J. O. Wilkes, Applied Numerical Methods, Wiley, New York, 
1969, pp. 361-363, for a short, but interesting, historical discussion of this point. 
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m 3 =hf\x jt + |, y k + 
m A = hf(x k + h, y k + m 3 ). 



Then, y fc+1 is given by 



( 6 ) 


This powerful method is capable of giving accurate results without taking 
h so small that computational labor becomes excessive or that numerical 
round-off becomes a serious problem. The local truncation error is C k = -y''(q) 
/i 5 /180 where x 0 < x < x n and the total truncation error is proportional to h 4 . 
This is one reason for its remarkable accuracy. 

We now apply (6) to approximate y(l) in our benchmarks problem (3). With 
h = 1, so that only a single step is required, we have 


m, = 1(0 + 1) = !, 


1(0+ 0.5+ 1+0.5)-2, 


l(0 + 0.5 + l + l) = 2.5. 


1(0 + 1+ 1+2.5)-4.5, 


so that 


y x = 1 +1(1 + 4 + 5 + 4.5) = 3.417. 
6 


This approximation is even better than the improved Euler method with 
h = 0.2! In Table 1, we show the result of applying the Runge-Kutta method to 
our benchmark problem with h = 0.2. Note especially that our approximate 
value for y(l) is 3.43650, which agrees with the exact value to four figures 
after the decimal point. The relative error is much smaller, in this case less 
than 0.2%. Halving the step size produces even better results, as shown in 
Table 2. With h- 0.1, the exact and computed solutions agree exactly to the 
number of decimal places shown, and the relative error at the end of the cal¬ 
culation is now less than 0.02 percent, a very nice result indeed! 
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TABLE 1 

Tabulated Values for Exact and Numerical Solutions to (3) 
with h = 0.2 Using the Runge-Kutta Method 



y« 

Exact 

E„ (%) 

0.0 

1.00000 

1.00000 

0.00000 

0.2 

1.24280 

1.24281 

0.00044 

0.4 

1.58364 

1.58365 

0.00085 

0.6 

2.04421 

2.04424 

0.00125 

0.8 

2.65104 

2.65108 

0.00152 

1.0 

3.43650 

3.43656 

0.00179 


TABLE 2 

Tabulated Values for Exact and Numerical Solutions to (3) 
with h = 0.1 Using the Runge-Kutta Method 




Exact 

E„ (%) 

0.0 

1.00000 

1.00000 

0.00000 

0.1 

1.11034 

1.11034 

0.00002 

0.2 

1.24281 

1.24281 

0.00003 

0.3 

1.39972 

1.39972 

0.00004 

0.4 

1.58365 

1.58365 

0.00006 

0.5 

1.79744 

1.79744 

0.00007 

0.6 

2.04424 

2.04424 

0.00008 

0.7 

2.32750 

2.32751 

0.00009 

0.8 

2.65108 

2.65108 

0.00010 

0.9 

3.01920 

3.01921 

0.00011 

1.0 

3.43656 

3.43656 

0.00012 


Problems 

For the following problems, use the Runge-Kutta method with h = 0.1, 0.05, 
and 0.01 to estimate the solution at x = l. Compare your results to the exact 
solution and the results obtained with both the Euler method in Section 73 
and the improved Euler method in Section 75. 


1. y' =2x + 2y, y(0) = 1. 

2. y' = \/y, i/(0) = 1. 

3. y' = e y , y(0) = 0. 

4. y' =y - sinx, y(0) = -l. 

5. y' -(x + y - l) * 1 2 3 4 5 , y(0) = 0. 
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6. Are there any other numerical integration rules that could be used to 
generate methods as accurate as the Runge-Kutta method or more so? 
Find one and attempt to work out the steps necessary for an algorithm. 
Check your results against the benchmark problem and discuss your 
findings. 

7. Use the Runge-Kutta method with h = 0.2 and solve the following 
equation 

fy-3fy' + 3y=l,y(l) = 0,y'(l) = 0. 

Determine the exact solution and compare your results. Does the dif¬ 
ferential equation possess a solution at f = 0? How might the Runge- 
Kutta method be employed to compute the solution there? 


77 Systems 

Heretofore our numerical methods have been employed against first order 
initial-value problems. It should be clear that many important physical prob¬ 
lems are modeled by second and higher order equations (such as vibrat¬ 
ing mechanical systems), or even directly as systems of equations (such as 
predator-prey systems). It is therefore natural to seek ways in which our 
methods can be extended to treat these types of problems. 

Since d 2 y/dt 2 =f(t,y,dy/dt) can be transformed into the system of first order 
equations dy/dt = x and dx/dt=f(t,y,x), it is customary to transform all higher 
order differential equations into systems of first order equations. In this sec¬ 
tion we will discuss formulas that explicitly treat systems of two first order 
equations, but the results can be generalized to more equations with relative 
ease. It should be noted that serious scientific and engineering applications, 
employing models composed of complicated systems of differential equa¬ 
tions, are almost always solved with methods (albeit with a bit more sophis¬ 
tication) very much like the ones we will describe here. 

Our objective is to formulate methods for generating numerical solutions 
to the following system of equations: 

x'=f{t,x,y), (1) 

y' =g(t,x,y), (2) 

with initial conditions 

x(t 0 ) = x 0 , y(i 0 ) = l/o- (3) 

We assume, of course, that the functions / and g are sufficiently smooth so 
that unique solutions to (1), (2), and (3) exist. 11 As in the previous sections, we 


11 See Chapter 11. 
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seek to construct approximate solutions x n and y n to the system at the points 
t=to, f = t 0 +h,.. ., t n =t 0 +nh. 

The Euler method takes on an entirely analogous form for this case and is 
given below: 


Xjt+ 1 =x k +hf(t k/ x k ,yj, 

(4) 

y k+ i=yk+h8(tk,x k ,y^ 

(5) 


where k = 0,11. The expression for the local truncation error is more 
complicated for the Euler method in this instance, but it remains true that the 
total discretization error is proportional to h. 

Consider the following linear, second order, nonhomogeneous differential 
equation: 


dyf 

dt 2 


+ 4 y = cos t 


( 6 ) 


with initial conditions y(0) = y'(0) = 0. Equation (6) can be thought of as a 
model for an undamped spring-mass subject to a sinusoidal exterior driving 
force. At time t = 0, the mass lies at its equilibrium position with no initial 
velocity. The exact solution to (6) is 


1 

y = —(cos t - cos 2t). 

Cast into system form, we first let y' = x. Then 


-4y + cos f. 

(7) 

y'=x. 

(8) 


with initial conditions x(0) = y(0) = 0. Table 1 contains the tabulated results 12 
for this system on the interval 0 < t < 1 using the Euler method with h = 0.1 
Note that the relative error for y starts out extremely large, decreases to a 
rather small value, and then begins to increase again. See Problem 5 for a 
discussion of this phenomenon. 


12 This tabulation should convince anyone (should such convincing be needed) trying such a 
calculation by hand that there is nothing like a computer, together with a good program¬ 
ming language, for accomplishing such a task. Imagine what it was like in the old days (pre- 
World War II), when virtually all engineering computations were done with a pencil, paper, 
and perhaps a desk calculator. 
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TABLE 1 

Tabulated Values for Exact and Numerical Solutions to (7) 
and (8) with h = 0.1 Using the Euler Method 


t„ 

x„ 

Vn 

Exact x 

Exact y 

E„ for y (%) 

0.0 

0.00000 

0.00000 

0.00000 

0.00000 

— 

0.1 

0.10000 

0.00000 

0.09917 

0.00498 

100 

0.2 

0.19950 

0.01000 

0.19339 

0.01967 

49 

0.3 

0.29351 

0.02995 

0.27792 

0.04333 

31 

0.4 

0.37706 

0.05930 

0.34843 

0.07478 

21 

0.5 

0.44545 

0.09701 

0.40117 

0.11243 

14 

0.6 

0.49440 

0.14155 

0.43315 

0.15433 

8.3 

0.7 

0.52032 

0.19099 

0.44223 

0.19829 

3.7 

0.8 

0.52040 

0.24302 

0.42726 

0.24197 

0.4 

0.9 

0.49286 

0.29506 

0.38812 

0.28294 

4.3 

1.0 

0.43700 

0.34435 

0.32571 

0.31882 

8.0 


The Runge-Kutta method for this system is 

X k+ i = X k + h \% + \l k2 + 1% + M, 

6 

Vk + i = Vk + ](v k : + v k2 + v k3 + v u ), 

6 

where 

fhi = hf(t k ,x k ,y k ), 

v k i = hg(t k ,x k ,y k ), 

fh2 = hf^t k + — , x k + ^j~, y k + 

hfc 3 =hf^t k +—, x k + ^y", y^ + ^j, 
v k 3 = +—, x k + y ^/ yt + 


(9) 

( 10 ) 


h/t4 — hf(t k + h,x k + 143/y/c + ^t3)/ 
hu = hg(t k + h,x k + 14 3 , y* + 0*3). 
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TABLE 2 

Tabulated Values for Exact and Numerical Solutions to (7) and 
(8) with h = 0.1 Using the Runge-Kutta Method 


t„ 

x„ 

y„ 

Exact x 

Exact y 

E„ for y (%) 

0.0 

0.00000 

0.00000 

0.00000 

0.00000 

— 

0.1 

0.09917 

0.00498 

0.09917 

0.00498 

0.0006 

0.2 

0.19339 

0.01967 

0.19339 

0.01967 

0.0018 

0.3 

0.27792 

0.04333 

0.27792 

0.04333 

0.0022 

0.4 

0.34843 

0.07478 

0.34843 

0.07478 

0.0023 

0.5 

0.40117 

0.11242 

0.40117 

0.11243 

0.0024 

0.6 

0.43314 

0.15432 

0.43315 

0.15433 

0.0024 

0.7 

0.44223 

0.19829 

0.44223 

0.19829 

0.0024 

0.8 

0.42726 

0.24196 

0.42726 

0.24197 

0.0023 

0.9 

0.38813 

0.28293 

0.38812 

0.28294 

0.0022 

1.0 

0.32571 

0.31881 

0.32571 

0.31882 

0.0021 


The total discretization error for this more general Runge-Kutta method 
remains proportional to h * 1 2 3 4 . The numerical solution of (7) and (8) with a step 
size of 7z = 0.1 is displayed in Table 2. Note that the relative error is signifi¬ 
cantly smaller than that seen with the Euler method as shown in Table 1, and 
furthermore, the relative error does not exhibit the same degree of fluctua¬ 
tion as that case. 


Problems 

1. Use the Euler method, with step size h = 0.2 to evaluate the solution to 
y" - y= 0, i/(0) = 0, y'(l) = 0 at t = 0.2 and r = 0.4. Compare your results to 
the exact solution. 

2. Use the Euler method, with step size h = 0.1 to evaluate the solution to 
the following system of equations at t = 0.5: 

x'=y' 

y' = x(l - x), 

withx(0) = y(0) = l. 

3. Use the Runge-Kutta method (and a computer!) to evaluate the solution 

toy" - y( 1 - y)y' + y= 0,y(0) = 1 andy'(0) = 1, at t = 1. Use step sizes of 0.5, 
0.2, and 0.1. 
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4. Generalize the formulation of the Euler method to a system of three first 
order ordinary differential equations. 

5. Using the results listed in Table 1, sketch the graph of y„ and y versus f„. 
Explain the fluctuation in the relative error. Does the same error behav¬ 
ior occur for x„ and x? Why does the Runge-Kutta error (see Table 2) not 
behave this way? 


Taylor & Francis 

Taylor & Francis Group 

http://taylorandfrancis.com 
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TABLE 1 


Trigonometric Functions 

Angle Angle 


Degree 

Radian 

Sine 

Cosine 

Tangent 

Degree 

Radian 

Sine 

Cosine 

Tangent 

0 ° 

0.000 

0.000 

1.000 

0.000 






1 ° 

0.017 

0.017 

1.000 

0.017 

32° 

0.559 

0.530 

0.848 

0.625 

2 ° 

0.035 

0.035 

0.999 

0.035 

33° 

0.576 

0.545 

0.839 

0.649 

3° 

0.052 

0.052 

0.999 

0.052 

34° 

0.593 

0.559 

0.829 

0.675 

4° 

0.070 

0.070 

0.998 

0.070 

35° 

0.611 

0.574 

0.819 

0.700 

5° 

0.087 

0.087 

0.996 

0.087 

36° 

0.628 

0.588 

0.809 

0.727 

6 ° 

0.105 

0.105 

0.995 

0.105 

37° 

0.646 

0.602 

0.799 

0.754 

7° 

0.122 

0.122 

0.993 

0.123 

38° 

0.663 

0.616 

0.788 

0.781 

8 ° 

0.140 

0.139 

0.990 

0.141 

39° 

0.681 

0.629 

0.777 

0.810 

9° 

0.157 

0.156 

0.988 

0.158 

40° 

0.698 

0.643 

0.766 

0.839 

10 ° 

0.175 

0.174 

0.985 

0.176 

41° 

0.716 

0.656 

0.755 

0.869 

11 ° 

0.192 

0.191 

0.982 

0.194 

42= 

0.733 

0.669 

0.743 

0.900 

12 ° 

0.209 

0.208 

0.978 

0.213 

43° 

0.750 

0.682 

0.731 

0.933 

13° 

0.227 

0.225 

0.974 

0.231 

44° 

0.768 

0.695 

0.719 

0.966 

14° 

0.244 

0.242 

0.970 

0.249 

45° 

0.785 

0.707 

0.707 

1.000 

15° 

0.262 

0.259 

0.966 

0.268 

46° 

0.803 

0.719 

0.695 

1.036 

16° 

0.279 

0.276 

0.961 

0.287 

47 = 

0.820 

0.731 

0.682 

1.072 

17° 

0.297 

0.292 

0.956 

0.306 

48° 

0.838 

0.743 

0.669 

1.111 

18° 

0.314 

0.309 

0.951 

0.325 

49° 

0.855 

0.755 

0.656 

1.150 

19° 

0.332 

0.326 

0.946 

0.344 

50° 

0.873 

0.766 

0.643 

1.192 

o 

O 

<N 

0.349 

0.342 

0.940 

0.364 

51° 

0.890 

0.777 

0.629 

1.235 

21 ° 

0.367 

0.358 

0.934 

0.384 

52° 

0.908 

0.788 

0.616 

1.280 

22 ° 

0.384 

0.375 

0.927 

0.404 

53° 

0.925 

0.799 

0.602 

1.327 

23° 

0.401 

0.391 

0.921 

0.424 

54° 

0.942 

0.809 

0.588 

1.376 

24“ 

0.419 

0.407 

0.914 

0.445 

55° 

0.960 

0.819 

0.574 

1.428 

25° 

0.436 

0.423 

0.906 

0.466 

56° 

0.977 

0.829 

0.559 

1.483 

26° 

0.454 

0.438 

0.899 

0.488 

57° 

0.995 

0.839 

0.545 

1.540 

27° 

0.471 

0.454 

0.891 

0.510 

58° 

1.012 

0.848 

0.530 

1.600 

28° 

0.489 

0.469 

0.883 

0.532 

59° 

1.030 

0.857 

0.515 

1.664 

29° 

0.506 

0.485 

0.875 

0.554 

60° 

1.047 

0.866 

0.500 

1.732 

30° 

0.524 

0.500 

0.866 

0.577 

61° 

1.065 

0.875 

0.485 

1.804 

31° 

0.541 

0.515 

0.857 

0.601 

62° 

1.082 

0.883 

0.469 

1.881 


(' Continued ) 
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TABLE 1 ( Continued ) 

Trigonometric Functions 


Angle 

Sine 

Cosine 

Tangent 

Angle 

Sine 

Cosine 

Tangent 

Degree Radian 

Degree 

Radian 

63° 

1.100 

0.891 

0.454 

1.963 

77° 

1.344 

0.974 

0.225 

4.332 

64° 

1.117 

0.899 

0.438 

2.050 

78° 

1.361 

0.978 

0.208 

4.705 

65° 

1.134 

0.906 

0.423 

2.145 

79° 

1.379 

0.982 

0.191 

5.145 

66° 

1.152 

0.914 

0.407 

2.246 

80° 

1.396 

0.985 

0.174 

5.671 

67° 

1.169 

0.921 

0.391 

2.356 

81° 

1.414 

0.988 

0.156 

6.314 

o 

00 

VO 

1.187 

0.927 

0.375 

2.475 

82° 

1.431 

0.990 

0.139 

7.115 

69° 

1.204 

0.934 

0.358 

2.605 

83° 

1.449 

0.993 

0.122 

8.144 

o 

o 

1.222 

0.940 

0.342 

2.748 

84° 

1.466 

0.995 

0.105 

9.514 

71° 

1.239 

0.946 

0.326 

2.904 

85° 

1.484 

0.996 

0.087 

11.43 

72° 

1.257 

0.951 

0.309 

3.078 

86° 

1.501 

0.998 

0.070 

14.30 

73° 

1.274 

0.956 

0.292 

3.271 

87° 

1.518 

0.999 

0.052 

19.08 

74° 

1.292 

0.961 

0.276 

3.487 

88° 

1.536 

0.999 

0.035 

28.64 

75° 

1.309 

0.966 

0.259 

3.732 

89° 

1.553 

1.000 

0.017 

57.29 

76° 

1.326 

0.970 

0.242 

4.011 

90° 

1.571 

1.000 

0.000 
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TABLE 2 


Exponential Functions 


X 

e x 

e~ x 

X 

e x 

e~ x 

0.00 

1.0000 

1.0000 

2.5 

12.182 

0.0821 

0.05 

1.0513 

0.9512 

2.6 

13.464 

0.0743 

0.10 

1.1052 

0.9048 

2.7 

14.880 

0.0672 

0.15 

1.1618 

0.8607 

2.8 

16.445 

0.0608 

0.20 

1.2214 

0.8187 

2.9 

18.174 

0.0550 

0.25 

1.2840 

0.7788 

3.0 

20.086 

0.0498 

0.30 

1.3499 

0.7408 

3.1 

22.198 

0.0450 

0.35 

1.4191 

0.7047 

3.2 

24.533 

0.0408 

0.40 

1.4918 

0.6703 

3.3 

27.113 

0.0369 

0.45 

1.5683 

0.6376 

3.4 

29.964 

0.0334 

0.50 

1.6487 

0.6065 

3.5 

33.115 

0.0302 

0.55 

1.7333 

0.5769 

3.6 

36.598 

0.0273 

0.60 

1.8221 

0.5488 

3.7 

40.447 

0.0247 

0.65 

1.9155 

0.5220 

3.8 

44.701 

0.0224 

0.70 

2.0138 

0.4966 

3.9 

49.402 

0.0202 

0.75 

2.1170 

0.4724 

4.0 

54.598 

0.0183 

0.80 

2.2255 

0.4493 

4.1 

60.340 

0.0166 

0.85 

2.3396 

0.4274 

4.2 

66.686 

0.0150 

0.90 

2.4596 

0.4066 

4.3 

73.700 

0.0136 

0.95 

2.5857 

0.3867 

4.4 

81.451 

0.0123 

1.0 

2.7183 

0.3679 

4.5 

90.017 

0.0111 

1.1 

3.0042 

0.3329 

4.6 

99.484 

0.0101 

1.2 

3.3201 

0.3012 

4.7 

109.95 

0.0091 

1.3 

3.6693 

0.2725 

4.8 

121.51 

0.0082 

1.4 

4.0552 

0.2466 

4.9 

134.29 

0.0074 

1.5 

4.4817 

0.2231 

5 

148.41 

0.0067 

1.6 

4.9530 

0.2019 

6 

403.43 

0.0025 

1.7 

5.4739 

0.1827 

7 

1096.6 

0.0009 

1.8 

6.0496 

0.1653 

8 

2981.0 

0.0003 

1.9 

6.6859 

0.1496 

9 

8103.1 

0.0001 

2.0 

7.3891 

0.1353 

10 

22026 

0.00005 

2.1 

8.1662 

0.1225 




2.2 

9.0250 

0.1108 




2.3 

9.9742 

0.1003 




2.4 

11.023 

0.0907 
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TABLE 3 

Natural Logarithms (In x = log p x) 

This table contains logarithms of numbers from 1 to 10 to the base e. To obtain the natural 
logarithms of other numbers use the formulas: 

ln(10 r x) = ln.r + lnlO r lnf-^j-j = In x — In 10 ' 

In 10 = 2.302585 In 10 2 = 4.605170 In 10 3 = 6.907755 

In 10 4 = 9.210340 In 10 5 = 11.512925 In 10 6 = 13.815511 

X 0 123456789 


1.0 

0.0 

0000 

0995 

1980 

2956 

3922 

4879 

5827 

6766 

7696 

8618 

1.1 

0.0 

9531 

*0436 

*1333 

*2222 

*3103 

*3976 

*4842 

*5700 

*6551 

*7395 

1.2 

0.1 

8232 

9062 

9885 

*0701 

*1511 

*2314 

*3111 

*3902 

*4686 

*5464 

1.3 

0.2 

6236 

7003 

7763 

8518 

9267 

*0010 

*0748 

*1481 

*2208 

*2930 

1.4 

0.3 

3647 

4359 

5066 

5767 

6464 

7156 

7844 

8526 

9204 

9878 

1.5 

0.4 

0547 

1211 

1871 

2527 

3178 

3825 

4469 

5108 

5742 

6373 

1.6 

0.4 

7000 

7623 

8243 

8858 

9470 

*0078 

*0682 

*1282 

*1879 

*2473 

1.7 

0.5 

3063 

3649 

4232 

4812 

5389 

5962 

6531 

7098 

7661 

8222 

1.8 

0.5 

8779 

9333 

9884 

*0432 

*0977 

*1519 

*2078 

*2594 

*3127 

*3658 

1.9 

0.6 

4185 

4710 

5233 

5752 

6269 

6783 

7294 

7803 

8310 

8813 

2.0 

0.6 

9315 

9813 

*0310 

*0804 

*1295 

*1784 

*2271 

*2755 

*3237 

*3716 

2.1 

0.7 

4194 

4669 

5142 

5612 

6081 

6547 

7011 

7473 

7932 

8390 

2.2 

0.7 

8846 

9299 

9751 

*0200 

*0648 

*1093 

*1536 

*1978 

*2418 

*2855 

2.3 

0.8 

3291 

3725 

4157 

4587 

5015 

5442 

5866 

6289 

6710 

7129 

2.4 

0.8 

7547 

7963 

8377 

8789 

9200 

9609 

*0016 

*0422 

*0826 

*1228 

2.5 

0.9 

1629 

2028 

2426 

2822 

3216 

3609 

4001 

4391 

4779 

5166 

2.6 

0.9 

5551 

5935 

6317 

6698 

7078 

7456 

7833 

8208 

8582 

8954 

2.7 

0.9 

9325 

9695 

*0063 

*0430 

*0796 

*1160 

*1523 

*1885 

*2245 

*2604 

2.8 

1.0 

2962 

3318 

3674 

4028 

4380 

4732 

5082 

5431 

5779 

6126 

2.9 

1.0 

6471 

6815 

7158 

7500 

7841 

8181 

8519 

8856 

9192 

9527 

3.0 

1.0 

9861 

*0194 

*0526 

*0856 

*1186 

*1514 

*1841 

*2168 

*2493 

*2817 

3.1 

1.1 

3140 

3462 

3783 

4103 

4422 

4740 

5057 

5373 

5688 

6002 

3.2 

1.1 

6315 

6627 

6938 

7248 

7557 

7865 

8173 

8479 

8784 

9089 

3.3 

1.1 

9392 

9695 

9996 

*0297 

*0597 

*0896 

*1194 

*1491 

*1788 

*2083 

3.4 

1.2 

2378 

2671 

2964 

3256 

3547 

3837 

4127 

4415 

4703 

4990 

3.5 

1.2 

5276 

5562 

5846 

6130 

6413 

6695 

6976 

7257 

7536 

7815 

3.6 

1.2 

8093 

8371 

8647 

8923 

9198 

9473 

9746 

*0019 

*0291 

*0563 

3.7 

1.3 

0833 

1103 

1372 

1641 

1909 

2176 

2442 

2708 

2972 

3237 

3.8 

1.3 

3500 

3763 

4025 

4286 

4547 

4807 

5067 

5325 

5584 

5841 

3.9 

1.3 

6098 

6354 

6609 

6864 

7118 

7372 

7624 

7877 

8128 

8379 

4.0 

1.3 

8629 

8879 

9128 

9377 

9624 

9872 

*0118 

*0364 

*0610 

*0854 

4.1 

1.4 

1099 

1342 

1585 

1828 

2070 

2311 

2552 

2792 

3031 

3270 

4.2 

1.4 

3508 

3746 

3984 

4220 

4456 

4692 

4927 

5161 

5395 

5629 

4.3 

1.4 

5862 

6094 

6326 

6557 

6787 

7018 

7247 

7476 

7705 

7933 

4.4 

1.4 

8160 

8387 

8614 

8840 

9065 

9290 

9515 

9739 

9962 *0185 

( Continued) 
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TABLE 3 ( Continued ) 

Natural Logarithms (In x = log p x) 


X 


0 

1 

2 

3 

4 

5 

6 

7 

8 

9 

4.5 

1.5 

0408 

0630 

0851 

1072 

1293 

1513 

1732 

1951 

2170 

2388 

4.6 

1.5 

2606 

2823 

3039 

3256 

3471 

3687 

3902 

4116 

4330 

4543 

4.7 

1.5 

4756 

4969 

5181 

5393 

5604 

5814 

6025 

6235 

6444 

6653 

4.8 

1.5 

6862 

7070 

7277 

7485 

7691 

7898 

8104 

8309 

8515 

8719 

4.9 

1.5 

8924 

9127 

9331 

9534 

9737 

9939 

*0141 

*0342 

*0543 

*0744 

5.0 

1.6 

0944 

1144 

1343 

1542 

1741 

1939 

2137 

2334 

2531 

2728 

5.1 

1.6 

2924 

3120 

3315 

3511 

3705 

3900 

4094 

4287 

4481 

4673 

5.2 

1.6 

4866 

5058 

5250 

5441 

5632 

5823 

6013 

6203 

6393 

6582 

5.3 

1.6 

6771 

6959 

7147 

7335 

7523 

7710 

7896 

8083 

8269 

8455 

5.4 

1.6 

8640 

8825 

9010 

9194 

9378 

9562 

9745 

9928 

*0111 

*0293 

5.5 

1.7 

0475 

0656 

0838 

1019 

1199 

1380 

1560 

1740 

1919 

2098 

5.6 

1.7 

2277 

2455 

2633 

2811 

2988 

3166 

3342 

3519 

3695 

3871 

5.7 

1.7 

4047 

4222 

4397 

4572 

4746 

4920 

5094 

5267 

5440 

5613 

5.8 

1.7 

5786 

5958 

6130 

6302 

6473 

6644 

6815 

6985 

7156 

7326 

5.9 

1.7 

7495 

7665 

7843 

8002 

8171 

8339 

8507 

8675 

8842 

9009 

6.0 

1.7 

9176 

9342 

9509 

9675 

9840 

*0006 

*0171 

*0336 

*0500 

*0665 

6.1 

1.8 

0829 

0993 

1156 

1319 

1482 

1645 

1808 

1970 

2132 

2294 

6.2 

1.8 

2455 

2616 

2777 

2938 

3098 

3258 

3418 

3578 

3737 

3896 

6.3 

1.8 

4055 

4214 

4372 

4530 

4688 

4845 

5003 

5160 

5317 

5473 

6.4 

1.8 

5630 

5786 

5942 

6097 

6253 

6408 

6563 

6718 

6872 

7026 

6.5 

1.8 

7180 

7334 

7487 

7641 

7794 

7947 

8099 

8251 

8403 

8555 

6.6 

1.8 

8707 

8858 

9010 

9160 

9311 

9462 

9612 

9762 

9912 

*0061 

6.7 

1.9 

0211 

0360 

0509 

0658 

0806 

0954 

1102 

1250 

1398 

1545 

6.8 

1.9 

1692 

1839 

1986 

2132 

2279 

2425 

2571 

2716 

2862 

3007 

6.9 

1.9 

3152 

3297 

3442 

3586 

3730 

3874 

4018 

4162 

4305 

4448 

7.0 

1.9 

4591 

4734 

4876 

5019 

5161 

5303 

5445 

5586 

5727 

5869 

7.1 

1.9 

6009 

6150 

6291 

6431 

6571 

6711 

6851 

6991 

7130 

7269 

7.2 

1.9 

7408 

7547 

7685 

7824 

7962 

8100 

8238 

8376 

8513 

8650 

7.3 

1.9 

8787 

8924 

9061 

9198 

9334 

9470 

9606 

9742 

9877 

*0013 

7.4 

2.0 

0148 

0283 

0418 

0553 

0687 

0821 

0956 

1089 

1223 

1357 

7.5 

2.0 

1490 

1624 

1757 

1890 

2022 

2155 

2287 

2419 

2551 

2683 

7.6 

2.0 

2815 

2946 

3078 

3209 

3340 

3471 

3601 

3732 

3862 

3992 

7.7 

2.0 

4122 

4252 

4381 

4511 

4640 

4769 

4898 

5027 

5156 

5284 

7.8 

2.0 

5412 

5540 

5668 

5796 

5924 

6051 

6179 

6306 

6433 

6560 

7.9 

2.0 

6686 

6813 

6939 

7065 

7191 

7317 

7443 

7568 

7694 

7819 

8.0 

2.0 

7944 

8069 

8194 

8318 

8443 

8567 

8691 

8815 

8939 

9063 

8.1 

2.0 

9186 

9310 

9433 

9556 

9679 

9802 

9924 

*0047 

*0169 

*0291 

8.2 

2.1 

0413 

0535 

0657 

0779 

0900 

1021 

1142 

1263 

1384 

1505 

8.3 

2.1 

1626 

1746 

1866 

1986 

2106 

2226 

2346 

2465 

2585 

2704 

8.4 

2.1 

2823 

2942 

3061 

3180 

3298 

3417 

3535 

3653 

3771 

3889 

8.5 

2.1 

4007 

4124 

4242 

4359 

4476 

4593 

4710 

4827 

4943 

5060 
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TABLE 3 ( Continued ) 

Natural Logarithms (In x = log p x) 


X 


0 

1 

2 

3 

4 

5 

6 

7 

8 

9 

8.6 

2.1 

5176 

5292 

5409 

5524 

5640 

5756 

5871 

5987 

6102 

6217 

8.7 

2.1 

6332 

6447 

6562 

6677 

6791 

6905 

7020 

7134 

7248 

7361 

oo 

o 6 

2.1 

7475 

7589 

7702 

7816 

7929 

8042 

8155 

8267 

8380 

8493 

00 

LO 

2.1 

8605 

8717 

8830 

8942 

9054 

9165 

9277 

9389 

9500 

9611 

9.0 

2.1 

9722 

9834 

9944 

*0055 

*0166 

*0276 

*0387 

*0497 

*0607 

*0717 

9.1 

2.2 

0827 

0937 

1047 

1157 

1266 

1375 

1485 

1594 

1703 

1812 

9.2 

2.2 

1920 

2029 

2138 

2246 

2354 

2462 

2570 

2678 

2786 

2894 

9.3 

2.2 

3001 

3109 

3216 

3324 

3431 

3538 

3645 

3751 

3858 

3965 

9.4 

2.2 

4071 

4177 

4284 

4390 

4496 

4601 

4707 

4813 

4918 

5024 

9.5 

2.2 

5129 

5234 

5339 

5444 

5549 

5654 

5759 

5863 

5968 

6072 

9.6 

2.2 

6176 

6280 

6384 

6488 

6592 

6696 

6799 

6903 

7006 

7109 

9.7 

2.2 

7213 

7316 

7419 

7521 

7624 

7727 

7829 

7932 

8034 

8136 

9.8 

2.2 

8238 

8340 

8442 

8544 

8646 

8747 

8849 

8950 

9051 

9152 

9.9 

2.2 

9253 

9354 

9455 

9556 

9657 

9757 

9858 

9958 

*0058 

*0158 

10.0 

2.3 

0259 

0358 

0458 

0558 

0658 

0757 

0857 

0956 

1055 

1154 

X 


0 

1 

2 

3 

4 

5 

6 

7 

8 

9 


Note: The * indicates that the first two digits are those at the beginning of the next row. 
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TABLE 4 

Common Logarithms (log 10 x ) 


X 

0 

1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

0000 

0043 

0086 

0128 

0170 

0212 

0253 

0294 

0334 

0374 

11 

0414 

0453 

0492 

0531 

0569 

0607 

0645 

0682 

0719 

0755 

12 

0792 

0828 

0864 

0899 

0934 

0969 

1004 

1038 

1072 

1106 

13 

1139 

1173 

1206 

1239 

1271 

1303 

1335 

1367 

1399 

1430 

14 

1461 

1492 

1523 

1553 

1584 

1614 

1644 

1673 

1703 

1732 

15 

1761 

1790 

1818 

1847 

1875 

1903 

1931 

1959 

1987 

2014 

16 

2041 

2068 

2095 

2122 

2148 

2175 

2201 

2227 

2253 

2279 

17 

2304 

2330 

2355 

2380 

2405 

2430 

2455 

2480 

2504 

2529 

18 

2553 

2577 

2601 

2625 

2648 

2672 

2695 

2718 

2742 

2765 

19 

2788 

2810 

2833 

2856 

2878 

2900 

2923 

2945 

2967 

2989 

20 

3010 

3032 

3054 

3075 

3096 

3118 

3139 

3160 

3181 

3201 

21 

3222 

3243 

3263 

3284 

3304 

3324 

3345 

3365 

3385 

3404 

22 

3424 

3444 

3464 

3483 

3502 

3522 

3541 

3560 

3579 

3598 

23 

3617 

3636 

3655 

3674 

3692 

3711 

3729 

3747 

3766 

3784 

24 

3802 

3820 

3838 

3856 

3874 

3892 

3909 

3927 

3945 

3962 

25 

3979 

3997 

4014 

4031 

4048 

4065 

4082 

4099 

4116 

4133 

26 

4150 

4166 

4183 

4200 

4216 

4232 

4249 

4265 

4281 

4298 

27 

4314 

4330 

4346 

4362 

4378 

4393 

4409 

4425 

4440 

4456 

28 

4472 

4487 

4502 

4518 

4533 

4548 

4564 

4579 

4594 

4609 

29 

4624 

4639 

4654 

4669 

4683 

4698 

4713 

4728 

4742 

4757 

30 

4771 

4786 

4800 

4814 

4829 

4843 

4857 

4871 

4886 

4900 

31 

4914 

4928 

4942 

4955 

4969 

4983 

4997 

5011 

5024 

5038 

32 

5051 

5065 

5079 

5092 

5105 

5119 

5132 

5145 

5159 

5172 

33 

5185 

5198 

5211 

5224 

5237 

5250 

5263 

5276 

5289 

5302 

34 

5315 

5328 

5340 

5353 

5366 

5378 

5391 

5403 

5416 

5428 

35 

5441 

5453 

5465 

5478 

5490 

5502 

5514 

5527 

5539 

5551 

36 

5563 

5575 

5587 

5599 

5611 

5623 

5635 

5647 

5658 

5670 

37 

5682 

5694 

5705 

5717 

5729 

5740 

5752 

5763 

5775 

5786 

38 

5798 

5809 

5821 

5832 

5843 

5855 

5866 

5877 

5888 

5899 

39 

5911 

5922 

5933 

5944 

5955 

5966 

5977 

5988 

5999 

6010 

40 

6021 

6031 

6042 

6053 

6064 

6075 

6085 

6096 

6107 

6117 

41 

6128 

6138 

6149 

6160 

6170 

6180 

6191 

6201 

6212 

6222 

42 

6232 

6243 

6253 

6263 

6274 

6284 

6294 

6304 

6314 

6325 

43 

6335 

6345 

6355 

6365 

6375 

6385 

6395 

6405 

6415 

6425 

44 

6435 

6444 

6454 

6464 

6474 

6484 

6493 

6503 

6513 

6522 

45 

6532 

6542 

6551 

6561 

6571 

6580 

6590 

6599 

6609 

6618 

46 

6628 

6637 

6646 

6656 

6665 

6675 

6684 

6693 

6702 

6712 

47 

6721 

6730 

6739 

6749 

6758 

6767 

6776 

6785 

6794 

6803 

48 

6812 

6821 

6830 

6839 

6848 

6857 

6866 

6875 

6884 

6893 

49 

6902 

6911 

6920 

6928 

6937 

6946 

6955 

6964 

6972 

6981 
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TABLE 4 ( Continued) 

Common Logarithms (log 10 x ) 


X 

0 

1 

2 

3 

4 

5 

6 

7 

8 

9 

50 

6990 

6998 

7007 

7016 

7024 

7033 

7042 

7050 

7059 

7067 

51 

7076 

7084 

7093 

7101 

7110 

7118 

7126 

7135 

7143 

7152 

52 

7160 

7168 

7177 

7185 

7193 

7202 

7210 

7218 

7226 

7235 

53 

7243 

7251 

7259 

7267 

7275 

7284 

7292 

7300 

7308 

7316 

54 

7324 

7332 

7340 

7348 

7356 

7364 

7372 

7380 

7388 

7396 

55 

7404 

7412 

7419 

7427 

7435 

7443 

7451 

7459 

7466 

7474 

56 

7482 

7490 

7497 

7505 

7513 

7520 

7528 

7536 

7543 

7551 

57 

7559 

7566 

7574 

7582 

7589 

7597 

7604 

7612 

7619 

7627 

58 

7634 

7642 

7649 

7657 

7664 

7672 

7679 

7686 

7694 

7701 

59 

7709 

7716 

7723 

7731 

7738 

7745 

7752 

7760 

7767 

7774 

60 

7782 

7789 

7796 

7803 

7810 

7818 

7825 

7832 

7839 

7846 

61 

7853 

7860 

7868 

7875 

7882 

7889 

7896 

7903 

7910 

7917 

62 

7924 

7931 

7938 

7945 

7952 

7959 

7966 

7973 

7980 

7987 

63 

7993 

8000 

8007 

8014 

8021 

8028 

8035 

8041 

8048 

8055 

64 

8062 

8069 

8075 

8082 

8089 

8096 

8102 

8109 

8116 

8122 

65 

8129 

8136 

8142 

8149 

8156 

8162 

8169 

8176 

8182 

8189 

66 

8195 

8202 

8209 

8215 

8222 

8228 

8235 

8241 

8248 

8254 

67 

8261 

8267 

8274 

8280 

8287 

8293 

8299 

8306 

8312 

8319 

68 

8325 

8331 

8338 

8344 

8351 

8357 

8363 

8370 

8376 

8382 

69 

8388 

8395 

8401 

8407 

8414 

8420 

8426 

8432 

8439 

8445 

70 

8451 

8457 

8463 

8470 

8476 

8482 

8488 

8494 

8500 

8506 

71 

8513 

8519 

8525 

8531 

8537 

8543 

8549 

8555 

8561 

8567 

72 

8573 

8579 

8585 

8591 

8597 

8603 

8609 

8615 

8621 

8627 

73 

8633 

8639 

8645 

8651 

8657 

8663 

8669 

8675 

8681 

8686 

74 

8692 

8698 

8704 

8710 

8716 

8722 

8727 

8733 

8739 

8745 

75 

8751 

8756 

8762 

8768 

8774 

8779 

8785 

8791 

8797 

8802 

76 

8808 

8814 

8820 

8825 

8831 

8837 

8842 

8848 

8854 

8859 

77 

8865 

8871 

8876 

8882 

8887 

8893 

8899 

8904 

8910 

8915 

78 

8921 

8927 

8932 

8938 

8943 

8949 

8954 

8960 

8965 

8971 

79 

8976 

8982 

8987 

8993 

8998 

9004 

9009 

9015 

9020 

9025 

80 

9031 

9036 

9042 

9047 

9053 

9058 

9063 

9069 

9074 

9079 

81 

9085 

9090 

9096 

9101 

9106 

9112 

9117 

9122 

9128 

9133 

82 

9138 

9143 

9149 

9154 

9159 

9165 

9170 

9175 

9180 

9186 

83 

9191 

9196 

9201 

9206 

9212 

9217 

9222 

9227 

9232 

9238 

84 

9243 

9248 

9253 

9258 

9263 

9269 

9274 

9279 

9284 

9289 

85 

9294 

9299 

9304 

9309 

9315 

9320 

9325 

9330 

9335 

9340 

86 

9345 

9350 

9355 

9360 

9365 

9370 

9375 

9380 

9385 

9390 

87 

9395 

9400 

9405 

9410 

9415 

9420 

9425 

9430 

9435 

9440 

88 

9445 

9450 

9455 

9460 

9465 

9469 

9474 

9479 

9484 

9489 

89 

9494 

9499 

9504 

9509 

9513 

9518 

9523 

9528 

9533 

9538 
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TABLE 4 ( Continued) 

Common Logarithms (log 10 x ) 


X 

0 

1 

2 

3 

4 

5 

6 

7 

8 

9 

90 

9542 

9547 

9552 

9557 

9562 

9566 

9571 

9576 

9581 

9586 

91 

9590 

9595 

9600 

9605 

9609 

9614 

9619 

9624 

9628 

9633 

92 

9638 

9643 

9647 

9652 

9657 

9661 

9666 

9671 

9675 

9680 

93 

9685 

9689 

9694 

9699 

9703 

9708 

9713 

9717 

9722 

9727 

94 

9731 

9736 

9741 

9745 

9750 

9754 

9759 

9763 

9768 

9773 

95 

9777 

9782 

9786 

9791 

9795 

9800 

9805 

9809 

9814 

9818 

96 

9823 

9827 

9832 

9836 

9841 

9845 

9850 

9854 

9859 

9863 

97 

9868 

9872 

9877 

9881 

9886 

9890 

9894 

9899 

9903 

9908 

98 

9912 

9917 

9921 

9926 

9930 

9934 

9939 

9943 

9948 

9952 

99 

9956 

9961 

9965 

9969 

9974 

9978 

9983 

9987 

9991 

9996 


Note: Decimal points are omitted in this table; the entries 


0 12 
10 0000 0043 0086 

mean that log 10 (1.00 = 0.0000, log 10 (1.01) = 0.0043, and log 10 (1.02) = 0.0086 (to 
four-decimal-place accuracy). 
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TABLE 5 

Powers and Roots 


X 

X 2 


X 3 


1 

1 

1.000 

1 

1.000 

2 

4 

1.414 

8 

1.260 

3 

9 

1.732 

27 

1.442 

4 

16 

2.000 

64 

1.587 

5 

25 

2.236 

125 

1.710 

6 

36 

2.449 

216 

1.817 

7 

49 

2.646 

343 

1.913 

8 

64 

2.828 

512 

2.000 

9 

81 

3.000 

729 

2.080 

10 

100 

3.162 

1,000 

2.154 

11 

121 

3.317 

1,331 

2.224 

12 

144 

3.464 

1,728 

2.289 

13 

169 

3.606 

2,197 

2.351 

14 

196 

3.742 

2,744 

2.410 

15 

225 

3.873 

3,375 

2.466 

16 

256 

4.000 

4,096 

2.520 

17 

289 

4.123 

4,913 

2.571 

18 

324 

4.243 

5,832 

2.621 

19 

361 

4.359 

6,859 

2.668 

20 

400 

4.472 

8,000 

2.714 

21 

441 

4.583 

9,261 

2.759 

22 

484 

4.690 

10,648 

2.802 

23 

529 

4.796 

12,167 

2.844 

24 

576 

4.899 

13,824 

2.884 

25 

625 

5.000 

15,625 

2.924 

26 

676 

5.099 

17,576 

2.962 

27 

729 

5.196 

19,683 

3.000 

28 

784 

5.292 

21,952 

3.037 

29 

841 

5.385 

24,389 

3.072 

30 

900 

5.477 

27,000 

3.107 

31 

961 

5.568 

29,791 

3.141 

32 

1,024 

5.657 

32,768 

3.175 

33 

1,089 

5.745 

35,937 

3.208 

34 

1,156 

5.831 

39,304 

3.240 

35 

1,225 

5.916 

42,875 

3.271 

36 

1,296 

6.000 

46,656 

3.302 

37 

1,369 

6.083 

50,653 

3.332 

38 

1,444 

6.164 

54,872 

3.362 

39 

1,521 

6.245 

59,319 

3.391 

40 

1,600 

6.325 

64,000 

3.420 


(Continued) 
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TABLE 5 ( Continued) 

Powers and Roots 


X 

X 2 


X 3 

%[x 

41 

1,681 

6.403 

68,921 

3.448 

42 

1,764 

6.481 

74,088 

3.476 

43 

1,849 

6.557 

79,507 

3.503 

44 

1,936 

6.633 

85,184 

3.530 

45 

2,025 

6.708 

91,125 

3.557 

46 

2,116 

6.782 

97,336 

3.583 

47 

2,209 

6.856 

103,823 

3.609 

48 

2,304 

6.928 

110,592 

3.634 

49 

2,401 

7.000 

117,649 

3.659 

50 

2,500 

7.071 

125,000 

3.684 

51 

2,601 

7.141 

132,651 

3.708 

52 

2,704 

7.211 

140,608 

3.733 

53 

2,809 

7.280 

148,877 

3.756 

54 

2,916 

7.348 

157,464 

3.780 

55 

3,025 

7.416 

166,375 

3.803 

56 

3,136 

7.483 

175,616 

3.826 

57 

3,249 

7.550 

185,193 

3.849 

58 

3,364 

7.616 

195,112 

3.871 

59 

3,481 

7.681 

205,379 

3.893 

60 

3,600 

7.746 

216,000 

3.915 

61 

3,721 

7.810 

226,981 

3.936 

62 

3,844 

7.874 

238,328 

3.958 

63 

3,969 

7.937 

250,047 

3.979 

64 

4,096 

8.000 

262,144 

4.000 

65 

4,225 

8.062 

274,625 

4.021 

66 

4,356 

8.124 

287,496 

4.041 

67 

4,489 

8.185 

300,763 

4.062 

68 

4,624 

8.246 

314,432 

4.082 

69 

4,761 

8.307 

328,509 

4.102 

70 

4,900 

8.367 

343,000 

4.121 

71 

5,041 

8.426 

357,911 

4.141 

72 

5,184 

8.485 

373,248 

4.160 

73 

5,329 

8.544 

389,017 

4.179 

74 

5,476 

8.602 

405,224 

4.198 

75 

5,625 

8.660 

421,875 

4.217 

76 

5,776 

8.718 

438,976 

4.236 

77 

5,929 

8.775 

456,533 

4.254 

78 

6,084 

8.832 

474,552 

4.273 

79 

6,241 

8.888 

493,039 

4.291 

80 

6,400 

8.944 

512,000 4.309 

(Continued) 
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TABLE 5 ( Continued) 

Powers and Roots 


X 

X 2 


X 3 


81 

6,561 

9.000 

531,441 

4.327 

82 

6,724 

9.055 

551,368 

4.344 

83 

6,889 

9.110 

571,787 

4.362 

84 

7,056 

9.165 

592,704 

4.380 

85 

7,225 

9.220 

614,125 

4.397 

86 

7,396 

9.274 

636,056 

4.414 

87 

7,569 

9.327 

658,503 

4.431 

88 

7,744 

9.381 

681,472 

4.448 

89 

7,921 

9.434 

704,969 

4.465 

90 

8,100 

9.487 

729,000 

4.481 

91 

8,281 

9.539 

753,571 

4.498 

92 

8,464 

9.592 

778,688 

4.514 

93 

8,649 

9.644 

804,357 

4.531 

94 

8,836 

9.695 

830,584 

4.547 

95 

9,025 

9.747 

857,375 

4.563 

96 

9,216 

9.798 

884,736 

4.579 

97 

9,409 

9.849 

912,673 

4.595 

98 

9,604 

9.899 

941,192 

4.610 

99 

9,801 

9.950 

970,299 

4.626 

100 

10,000 

10.000 

1 , 000,000 

4.642 
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TABLE 6 

Factorials 


n 


n\ 


0 

1.00000 

00000 

E00 

1 

1.00000 

00000 

E00 

2 

2.00000 

00000 

E00 

3 

6.00000 

00000 

E00 

4 

2.40000 

00000 

E01 

5 

1.20000 

00000 

E02 

6 

7.20000 

00000 

E02 

7 

5.04000 

00000 

E03 

8 

4.03200 

00000 

E04 

9 

3.62880 

00000 

E05 

10 

3.62880 

00000 

E06 

11 

3.99168 

00000 

E07 

12 

4.79001 

60000 

E08 

13 

6.22702 

08000 

E09 

14 

8.71782 

91200 

E10 

15 

1.30767 

43680 

E12 

16 

2.09227 

89888 

E13 

17 

3.55687 

42810 

E14 

18 

6.40237 

37057 

E15 

19 

1.21645 

10041 

E17 

20 

2.43290 

20082 

E18 

21 

5.10909 

42172 

E19 

22 

1.12400 

07278 

E21 

23 

2.58520 

16739 

E22 

24 

6.20448 

40173 

E23 

25 

1.55112 

10043 

E25 

26 

4.03291 

46113 

E26 

27 

1.08888 

69450 

E28 

28 

3.04888 

34461 

E29 

29 

8.84176 

19937 

E30 

30 

2.65252 

85981 

E32 

31 

8.22283 

86542 

E33 

32 

2.63130 

83693 

E35 

33 

8.68331 

76188 

E36 

34 

2.95232 

79904 

E38 

35 

1.03331 

47966 

E40 

36 

3.71993 

32679 

E41 

37 

1.37637 

53091 

E43 

38 

5.23022 

61747 

E44 

39 

2.03978 

82081 

E46 

(' Continued ) 
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TABLE 6 ( Continued ) 


Factorials 


n 


n\ 


40 

8.15915 

28325 

E47 

41 

3.34525 

26613 

E49 

42 

1.40500 

61178 

E51 

43 

6.04152 

63063 

E52 

44 

2.65827 

15748 

E54 

45 

1.19622 

22087 

E56 

46 

5.50262 

21598 

E57 

47 

2.58623 

24151 

E59 

48 

1.24139 

15593 

E61 

49 

6.08281 

86403 

E62 

50 

3.04140 

93202 

E64 


Note: Values are given in scientific notation 
with the exponent denoted by E; for 
example, 2.65252 85981 E32 denotes 
2.6525285981 x 10 32 . 





Answers 


Section 2 

2. (a) v=— e —x~+c; 

J 3 2 

(b) y = logx + c; 

(c) y = ^ 2 +c; 

(d) y = x sin -1 x + V 1-x 2 + c; 

(e) y = x - log(l+x) + c; 


( f ) V = \ l °g(l + ^ 2 ) + c; 


(g) y = ^!og 


x 2 -x + l 

(x+1) 2 


2 

+ —j= tan -1 

73 


2x-l 

“7T 


+ c; 


( h ) V = -(tan -1 x) 2 +c; 

(i) x = c(y - l)e- v ; 

(j) x _4 + y _4 = c; 

(k) sin y = cxe~ 

(l) y = Ce * 2 ; 

(m) x 3 + 3 cos y = c; 

(n) y = -log(csc x + cot x) + c; 

(o) y-c cos x; 

(p) y = c sec x; 

. . c-x 

<<!) w 

(r) y=f" 

3. (a) y-xe x - e x + 3; 

(b) y = sin 2 x+l; 

(c) y = x log x - x; 


( d ) y^^iog 


3x-3 
x + 1 


681 
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(e) y = g!°g 


4-x 2 

3x 2 


( f ) V = -iog[(^ + i) 2 (^ 2 + i) 3 ]- 


1 _! , 
—tan x + 1. 
2 


4. (a) 3e 2 y~2e 3x + 1; 

(b) y = x 2 + logx; 

(c) tan -1 x + ev-l; 

(d) 2 sin 3x cos 2y = 1; 

(e) 2y +1 = e*(sin x + cos x); 

(f) log x(y + l) =y - x + 1. 

8. m = 1,1/2, -2; y = + c 2 e x/2 + c 3 e~ 2x . 


Section 3 

1. (a) x 2 -y 2 ~c, 

(b) x 2 + 2y 2 = c 2 ; 

(c) r = c(l - cos 0); 

(d) y 2 --2x + c. 

2. (a) x 2 + 4y 2 = c 2 ; 

(b) x 2 + ny 2 =c 2 . 

The orthogonal trajectories are ellipses, and are more and more elon¬ 
gated in the x direction as n is taken to be larger and larger. 

3. r-2c cos 0. 

4. r = c/(l + cos 0). 

5. y 2 = 2xy ^ + y 2 ^^j ; the family is self-orthogonal in the sense that 

when a curve in the family intersects another curve in the family, it is 
orthogonal to it. 

6. (a) xy-c; 

(b) y 2 =±2x + c; 

(c) y=ce ±x ; 

(d) y 2 -cx; 

(e) x 2 + 2y 2 = c 2 ; 

(f) y 2 -±x 2 + c; 
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(g) 0 = 0 or r = 2c sin 0; 

(h) 0 = 0 O or r-ce ke . 

7. y-cx 2 . 

8. xy=ce kx . 

9. The intersections of the cylinders xy = c with the saddle surface z=y 2 - x 2 . 
10 . (a) (xy' - y) 2 -x 2 (x 2 - y 2 ); 

(b) (x 2 -y 2 - l)y' = 2xy; 

(c) (x-y) 2 (l + y' 2 ) = (x + yy') 2 ; 

(d) y+y' 2 =xy'; 

(e) (y - xy’) 2 = l + y' 2 . 


Section 4 

. . lOOlog 2 

2. (a) T =- ^ —years; 

r 

(b) about 6.93 percent. 

3. (a) A = D 


v * y 


(b) $5986; 

(c) $1866. 
W 


4. (a) + P- 


W 


(b) W 0 = fcP; 

/ \ rp 1 , W 

(c) 


(d) about 13.86 years. 

5. If x=x(t) is his wealth at time t, and t = 0 one year ago, then x = 20/(2 - f). 
Thus, in 6 months x - 40 million dollars, and at the end of 1 year (as t -* 2) 
x becomes infinite. 

6. At about 10.11 p.m. 

7. 3531. 


8. In the year a.d. 2076; 6.6 billion. 

9. (b) About 15.2 grams. 

x 0 x 1 , 1 


10. x = 


x 0 +(xi-x 0 )e 


sj; when *=j xi- 
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12. About 35.35 percent; about 3.125 percent. 

13. About 133 days. 

14. About 13.53 percent. 

16. If B=A, then 

kA 2 abt 

X —-’ 

kAabt +1 ' 

and if B<A, then 

AB( i- e - k{A - B)M ) 
X A— p e ^ k ( A -B)abt 


The first formula is the limit of the second formula as B 
should prove this by using THospital's rule. 

17 l + (A/x) = 1 + (A/x 1 ) ‘ ,h 
1 + (A/x 0 ) _l + (A/x 0 )_ 

18. x = - - -zg-. 

Xq + (1 — x 0 )e 

19. 40 log 2 = 27.72 minutes. 

20. 2 log 2 = 1.39 hours. 

21. No later than 36 minutes after the smoking starts. 

22. 40 feet. 


23. 


_9_ 

25 


81 T 

to and 

625 



25. ^ -1 hours, 
log 2 

26. 60°. 

27. 16°. 

28. At 6 a.m. 

29. (a) About 3330 years (1380 b.c.); 

(b) about 3850 years (1900 b.c.); 

(c) about 10,510 years; 

(d) about 7010 years. 


A; students 


Section 5 

l~g 1 - 
V c i + 



1 . v - 


; the terminal velocity is 
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2. 2 miles. 

3. 256 feet; when t = 4, t = 8. vl/2g ; when t = v 0 /g, 2 v 0 /g. 

7. ^R;^R. 

8. JgR, which is approximately 5 miles/second. 


Miscellaneous Problems for Chapter 1 

1. (V5 -1) hours before noon. 

2. r = (2 - f)/8; one more month. 

3. After 100 log 2 minutes. 

4. 100(^2-1) minutes. 

5. The intersections of the cylinders x = cy 4 with 4x * 1 2 + y 2 + 4z 2 = 36. 

_ 14R 5/2 , 

7. - - ,_ seconds. 

I5r 2 j2g 

8. The shape of the surface obtained by revolving y = cx 4 about the y-axis. 


9. 25/2. 



log(4 + Vl5) seconds. 
= pT; T = T 0 e^ 9 . 


14. r = roe" 


x/lL 


15. The President. 

16. Go 2 miles toward the origin and then move outward along one of the 
spirals r = e ±6 ^. 

17. r - ~^= e~ e ; total distance = a. 

V2 


Section 7 

1. (a) y 2 =x 2 +cx 4 ; 

(b) y = cx 2 (x + y); 

(c) y = xtancx 3 4 5 * 7 8 ; 

(d) cos (y/x) + log cx = 0; 
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(e) y=x log (log cx 7 )-, 

(f) x 1 - Ixy ~y 2 =c ; 

(g) y = cx 3 -x; 



(i) y = cx 2 /( 1 - cx); 

(j) y 3 -x 3 log cx 3 . 

2. x 2 + y 2 -cy. 

3. (a) x + y = tan (x + c); 

(b) tan (x - y +1) = x + c. 

4. (b) z = dx + ey. 



(b) y - x-5 log (x + y - l) + c; 

(c) log:[(y - xf + (x - 1) 2 ] + 2 tan^ 1 ^ ^j = c; 

(d) (x + 2y)(x -2 y - 4) 3 = c; 

(e) (2 x - y + 3) 4 = c(x +1) 3 . 

6. (a) n = -1/2, x = ce i;r ; 

(b) « = 3/4, 2 + 5xy 2 = cx 5/2 ; 

(c) « = -1, x= cye xy . 

11. (a) r-ce B (in polar coordinates); 

(b) r-ce~ e -, 

(c) x 2 - y 2 -c. 


Section 8 

1. xy + log y 2 -c. 

2. Not exact. 

3. 4xy - x 4 + y 4 = c. 

4. Not exact. 

5. xy + sin xy = c. 

6. Not exact. 

7. xey + sin x cos y - c. 

„ x x 

8. cos — = c or — = c. 

y y 
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9. Not exact. 

10. x 2 y 3 + y sinx = c. 

n. iog!±a-2*=c. 

1 -xy 


12. x 2 y 4 + x sin y = c. 


13. log 


^ 1 + xy ^ 


+ x = c. 


v 1 “ xy y 

14. 3x 2 + 2(x 2 - y) 3/2 = c. 


15. Not exact. 

2 

16. xe- v + esc y cot x = c . 

17. x - y 2 cos 2 x=c. 

18. x 2 + y 2 = c 2 . 

19. x 3 (l + log y) -y 2 -c. 

20. -y + y 2 - x 2 = c(x + y) or x + y 2 

21. x 2 y 2 (4y 2 -x 2 ) = c. 

22. (a) n = 3, x 2 y 2 + 2x 3 y = c; 

(b) n = 1, x 2 + e 2xy =c. 


x 2 -c{x + y). 


Section 9 

l 

2. (a) y = —^,x 2 -y 2 = cy 3 ; 

y 

(b) g = — ,2xy-logx 2 -y 2 =c; 

1 

(c) g = ——3,3x 2 y 4 = l + cx 2 y 2 ; 

(*y) 

(d) |i = sin y, e* sin y + y 2 = c; 

(e) |i = xe*, x 2 e r sin y = c; 

1 , 

( f ) 1 * = 7 ^ 2' 1 + i !/ = c * 2 /; 

(*y) 

(g) [j — x 2 ,4x 3 y 2 + x 4 — c; 

(h) y = y, xy 2 - e% 2 - 2y + 2) = c; 

(i) |x= —,xlogy-x 2 + y = c; 

y 

(j) g = e x v, e xi/ (x+y)=c, 

(k) g = e x ~ /2 , e x ,2 {y i + x 2 - 2) = c. 
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3. When ( dM/dy - dN/dx)/(N - M) is a function g(z) of z = x + y. 

. , . x 1 

4. (a) — = — + y + c; 

y y 

(b) log- = iy 3 + c; 

y ^ 

(c) tan -1 — = - — x 4 + c; 

y_ 4 

(d) log Jx 2 + y 2 = tan -1 — + c; 

y 

(e) tan -1 — = 3x + c; 

x 

(f) y=x/(x + c); 

(g) y = 2x 2 + 3 + cx; 

(h) 2 Jxy=y + c; 

(i) —-— log x + y = c', 

xy 

(j) 3x + x 3 y 4 + ci/ = 0; 

(k) x (y 5 + cy) = 4; 

(l) y = x/(x 2 + c); 

(m) xy + x cos x = sin x + c. 

5. x 2 cos (y/x 2 ) + y sin (y/x 2 ) = cx 3 . 

6. r = c/( 1 - cos 0), a parabola. 


Section 10 

2. (a) y = x 4 + cx 3 ; 

(b) y = tan' 1 e x + ce - *; 

(c) y = (l + x 2 )' 1 log (sin x) + c(l + x 2 ) -1 ; 

(d) y = x 2 e - - r + x 2 - 2x + 2 + ce~ x ', 

(e) y=x 2 csc x+c esc x; 

(f) y = -x 3 + cx 2 ; 

(g) xy szn x-sin x - x cos x + c; 

(h) y = 3x V' + ce x ; 

(i) y = (x 3 + c)/log x; 

(j) y = x 2 (l +ce 1/x ). 
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l 

3. (a) , = -x 4 + cx * 1 2 ; 

y 

(b) y 3 = 3 sin x + 9x _1 cos x - 18x -2 sin x - 18x -3 cos x + cx -3 ; 

(c) 1 + x\j log x = cxy. 

4. (a) xi/ 2 = ey + c; 

(b) x-yev + cy; 

(c) 1 =x 2 (y + cey); 

(d) 2x/(y) 3 =f (y) 2 + c. 

5. (a) x = y - 2 + ce~y; 

(b) 3x + y 2 =Cyjy. 

7. logy = 2x 2 + cx. 

8. y = tanx - secx. 

9. x = (10-f)-^(10-f) 4 ;0<f <10. 

10. (a) 45 pounds; 

(b) after^-(3- -J3) = 16.9minutes. 

11. (a) If k 2 * k lr y = -— 1 -e _fc!t );andif k 2 = k lr y = k]X 0 te 

k 2 -k x 

(b) About 66 days. 


Section 11 

1. (a) y 2 ~c 1 x + c 2 ; 

(b) x 2 + (y-c 2 ) 2 = c 2 ; 

(c) y=c 1 e kx +c 2 e~ kx ; 

(d) y = -—x 2 -CiX-c 2 log(x - Ci) + c 2 ; 

(e) 2 ^Cty -1 = ±CiX + c 2 ; 

(f) y = c 2 e cix ) 

(g) y = x 2 + Cj logx + c 2 . 

2. (a) y = l or 3y+x 3 =3; 

(b) 2y - 3 = 8ye 3x/2 ; 

(c) y=-log(2e-*-l). 

3. (a) y = -log [cos (x + q)] + c 2 ; 

(b) y = log(c 1 e*+e-*) + c 2 . 


-kit 
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4. T = 2n^R/g = 89 minutes. 

5. s = s 0 cos jg/Aa t, period = An^ja/g. 


Section 12 


2. T 0 i/" = w(s)y]l + ( xj') 2 + L(x). 

3. A parabola. 

5. y = c(e ax + e~ ax ), where the bottom of the curtain is on the x-axis and the 

lowest point of the cord is on the y-axis. 

6. A horizontal straight line or a catenary. 


8- (a) y = 2 


1 

1 + jfc 


\+k 


1 

lHc 


1 -k 


+ 


ck 

1 -k * 2 3 * * 6 ' 


so the distance the rabbit runs is ck/{ 1 - k 2 ). 


(b) y = 


2 2 
X -C 


2c 


■dog 


and the dog can get closer than c/2 + e for any e>0 but not as close 
as c/2. 


f „t+i 


9. y = 




c 

„/t-l 


If (/c> 1), then y as i -» 0 and the boat will never land. If 

a = b(k = 1), then y -> - c/2 as x ^ 0 and the boat will land at (0, -c/2). If 
a < h (k< 1), then y ^ 0 as x 0 and the boat will land at the origin. 


Section 13 


2. (a) 1 = 
(b) I = 


R-kL 

E 0 


- e + In - 


Eo 


,-Rt/L. 


Vr 2 +lV 


R-kL 
sin (rat-a) +1 I 0 + 


E 0 L(a 

R 2 + LV 


,-Rt/L 


where tan a = Leo/ R. 


4. (a) Q = L 0 C(1 - e~ t/RC ); 
(b) case A, RC = 1, 

Q = E 0 Cte~ t - 
case 2, RC * 1, 
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(c) 



[RCrosinrot + cosrof-e t/RC 


5. Q = Q 0 cos( t/jLC),I = (-Qo/VLC )sin (t/VLC). 


Miscellaneous Problems for Chapter 2 



3. 3 tan 1 ^ + ^ = log[(y +1) 2 + (x -1) 2 ] + c. 
x — 1 



5. 3y = 2x 2 + cx 2 y 3 . 


7. y 2 = c 2 e 2x + c 1 . 

8. xy = x sin x + cos x + c. 

9. y - x log y + cx. 

10. ye x - x 2 y 3 = c. 

11. c 1 tan* 1 c 1 x = y + c 2 . 

12. y-x 2 + cx. 

13. y - x sin x + 2 cos x - 2x _1 sin x + cx -1 . 

14. (3x + 2 y) + log (3x + 2 y) 2 + x = c. 

15. x cos (x + y) - c. 

1 , 

16. y = — (logx) +Cilogx + c 2 . 

17. ye”'+ sin x = c. 

18. (x - y) log (x - y) = c - y. 

19. y = xe' 1 + ce~ x . 

20. x 2 y 2 - 2x 3 y - x 4 = c. 

21. y- x 4 (l + x 2 )- 1 + c(l + x 2 )- 1 . 

22. e x sin y + cos xy = c. 



24. 2xe v + x 2 + y 2 - 2x 2 y = c. 
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2xe x e~v + y 2 -c. 
y 4 - x 4 log x 4 = cx 4 . 

3 y cos 3 x = 3 sin x - sin 3 x + c. 
y = x(cx 2 - 1)/(cx 2 +1). 
l + e ( */ y ) = cy. 


25. 

26. 

27. 

28. 

29. 

30. (5y + 4) 2 - 4(5y + 4)(5x + 2) - (5x + 2) 2 = c. 

31. x 3 log y-c. 

32. 


2 , 5x 

y log-3cosy = c. 


33. 

34. 

35. 

36. 

37. 

38. 

39. 

40. 

41. 

42. 

43. 

44. 

45. 

46. 

47. 

48. 

49. 

50. 

51. 

52. 

53. 


x + 3 
x = c(x + y) 2 . 

logx —— = c. 
xy 

1 2 Cl 


y = -x -ylog(x +Ci) + c 2 . 
x 3 y - xy 3 = c. 

4x 2 y = (x 2 +1) 3 + c(x 2 +1). 

3(y - l) 2 +4(y - l)(x +1) + 3(x +1) 2 =c. 


x 3 ev - x 2 + cos y-c. 
x-cyy - log Cjy. 
xy(x + y) 2 =c. 
y-x tan (log cx). 


— = 1 + logx + cx. 

y 

[cos y][log (5x +15)] + log y = c. 

cyy 2 = CjX + log (CjX - 1) + c 2 . 

xye x - e x -c. 

y-x 2 /(c - x). 

y 3 = 3(c 2 - x - cpj). 

x = esc y[log (sec y) + c]. 

When t = 25. 
ce 2* s i 5 = y^f 
y + x 

rlrr 

(a) “^ + [s(t) + I]z = -Iz 2 . 
at 


(b) y = l + z, where 


— = e 
z 




((l/2)at 2 +It) | e -({l/2)at 2 +It)^ t + Q 
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55. Burnout velocity = b log 


1 + 


m 2 

nil 


gm 2 _ 
/ 
a 


i . .. gmj bm 2 bm 1 , m x 

burnout height = - + -+-log-. 

2a a a nil + 

59. (a) If the constant acceleration due to the constant gravitational field is 
denoted by A, then 


v = c 


f -y _ g -2 At/c \ 

1 + p-Ut/c 

v ITC 


Section 14 

1. (a) y = Ci + c 2 x 2 ; 

(b) n-1, y-Ci + c 2 x 2 + x 3 . 

2. y-Ci + c 2 log x. 

3. (a) y - c x e~ x + c 2 e 2x ; 

(b) y = Cie~ x + c 2 e 2x - 2x + 1. 

4. (a) y=l/(2x); 

(b) y = -3x; 

(c) y = -|sinx. 

5. (a) y-CiX + c 2 +e x ; 

(b) y-Ci + c 2 e 2x - 2x; 

1 

(c) y = Cie* +c 2 e x - — sinx; 

(d) y-CiX + c 2 e x ; 

(e) y=c l +c 2 e~ 2x +2e x . 

6. (a) x 2 y" - 2xy' + 2y = 0; 

(b) y"-Fy = 0; 

(c) y" + fc 2 y = 0; 

(d) y'' + 2y' = 0; 

(e) (1 - x cot x)y" -xy' + y- 0; 

(f) y" - 2y' +y = 0; 

(g) y" + 2y'-3y=0; 

(h) x 2 y" + xx/-y = 0. 
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Section 15 

2. y-x + lx 2 . 

3. y=- 3e x +2e 2x . 

5. i/j = x 2 , y 2 =x~\ y = 3x 2 - 2x _1 . 

6. (a) y- 6e x + 2e~ 2x ) 

(b) y = 0; 

(c) y = 4e~ 2x -3e~ 3x ; 

(d) y = e~ 2 -e~ x . 

7. (a) y - a constant or y = log (x + c ,) + c 2 . 
11. (a )u = e 2> ,if + \Q-±F-±P 2 \v = 0. 


(b) y = {c\X+c 2 )e * 2n 


Section 16 

2. (a) y 2 — —cos x, y~c 1 sin x + c 2 cos x; 



6. y = c l x~ l/2 sin x + c 2 x _1/2 cos x. 

7. (a) y-cpx + c 2 e x ; 

(b) y = c x x + c 2 x~ 2 ) 

(c) y-c r x+c 2 xe x . 

8. y = CjX + c 2 x|x _2 e^6)*rfx. 

9. y=c/+c 2 xV. 

10. (a) y! = e x ,y 2 = e r ]x n e~ x dx. 

(b) y-c x e x + c 2 (x +1 ), y = c x e x + c 2 (x 2 + 2x + 2), 
y = +c 2 (x 3 + 3x 2 +6x + 6). 

11. y = qe*+c 2 e t Je[ _2 * + W9*] dx. 
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Section 17 

1. (a) y=c l e 2x +c 2 e~ 3x ; 

(b) y=c l e~ x +c 2 xe~ x ; 

(c) y = Ci cos 2V2x + c 2 sin 2 V2x; 

(d) y = e 1 (ci cos \[3x + c 2 sin V3x) ; 

(e) y=c 1 e 2x +c 2 xe 2x -, 

(f) y=c 1 e 5x +c 2 e‘ tx ; 

(g) y = e - * 72 ^Ci cos ^ V5x + c 2 sin ^ V5x j; 

(h) y=c 1 e 3r/2 +c 2 xe 3 * /2 ; 

(i) y = c 1 + c 2 e~ I ; 

(j) y = e 3x (Cj cos 4x + c 2 sin 4x); 

(k) y=c l e~ 5x/2 +c 2 pce~ 5x/2 ; 

(l) y = e~ x (c\ cos V2x + c 2 sin V2x); 

(m) y = c 1 e 2jr + c 2 e _2;c ; 

(n) y = e x ^c x cos ^ V3x + c 2 sin ^ V3xj; 

(o) y=c 1 e x/2 +c 7 e~ Xm , 

(p) y = c 1 e t/4 +c 2 xe jr/4 ; 

(q) y = e -2 *^ cos x + c 2 sin x); 

(r) y=c 1 e x +c 2 e~ 5x . 

2. (a) y-e 3x ~ 1 -, 

(b) y=e x + 2e 5x ; 

(c) y=5xe 3x ; 

(d) y = e _2l '(cos x + 2 sin x); 

(e) y = e (- 2 +^_ 2e (- 2 -^b. 


9 x _, 1 




5. (a) y = x _1 [c 1 cos (log x 3 ) + c 2 sin (log x 3 )]; 

(b) y = qx -2 + c 2 x~ 2 log x; 

(c) y = c 1 x 3 +c 2 x -4 ; 

(d) y = c 1 x 3/2 + c 2 x _1/2 ; 

(e) y = cyx 2 + c 2 x 2 log x; 

(f) y = c 1 x 2 +c 2 x -3 ; 


(g) y = * 


- 1/2 


Ci cos [ — VlT log x ] + c 2 sin [ — VlT log 


V2 
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(h) y = CiX^~ + c 2 x 
(i) y-c l x i + c 2 x~ i . 

7. (a) y = e~ xZ/4 (c 1 cos-^'j3x 2 + c 2 sin^\/3x 2 \ 
(b) not possible. 


Section 18 


1. (a) y =Cie 2x + c 2 e 5x + —e 4x ; 

(b) y-c 1 sin lx + c 2 cos lx + sin x; 

(c) y = c 1 e~ 5x + c 2 xe~ 5x +7x 2 e~ 5x ; 

(d) y = e x (c l cos lx + c 2 sin 2x) + 2 + 4x + 5x 2 ; 

(e) y = c x e 3x + c 2 e~ lx - Axe~ 2x \ 

(f) y = c x e x + c 2 e 2x +2 sin 2x + 3 cos 2x; 

(g) y=c x sin x + c 2 cos x + x sin x; 

(h) y-c x + c 2 e 2x + 2x - 3x 2 ; 

(i) y = c 1 e T +c 2 xe’ : +3x 2 e- 1: ; 

(j) y = e x {c x cos x + c 2 sin x) - — xe* cos xi 


(k) y = c x +c 2 e _1 + 2x 5 - 10x 4 +40x 3 - 120x 2 +242x. 

2. y = Ci sinkx + c 2 cos kx + _^ n ^ unless b = k, in which case y = c x sin kx + 


c 2 cos kx - 


xcos kx 
2k 


3. 


(a) y-c x sin 2x + c 2 cos 2x + x sin 2x + 2 cos x - 1 - x + 2x 2 ; 

1 1 

(b) y = Cisin3x + c 2 cos3x-—xcos3x + —sinx-2e -21 +3x 3 -2x. 


Section 19 

1. y p = 2x + 4. 

2 - = 

1 

3. (a) y p = - — cos2xlog(sec2x + tan2x); 

(b) y P = ^ x 2 e~ x log x - ^ x 2 e~*; 

(c) y p =-e~*(8x 2 + 4x + l); 








Answers 


697 


1 l 

(d) y p =—xe sin2x + — e * 1 cos2xlog(cos2x); 

( e ) ^ = 

(f) y p = e x log (1 + e~ x ) - e x + e 2x log (1 + e~ x ). 

4. (a) y p - x sin x + cos x log (cos x); 

(b) y v = cos x log (esc x + cot x) -2; 

1 1 

(c) y p = — cosxlog(secx + tanx)- — sinxlog(cscx + cotx); 

1 , 

(d) y p = — (x sinx + xcosx-sinx); 

(e) y p = -cos x log (sec x + tan x); 

(f) y p = x cos x - sin x - sin x log (cos x); 

(g) y p = -sin x log (esc x + cot x) - cos x log (sec x + tan x). 

5. (b) y p (x) = —f f(t)sink(x-t)dt. 

k Jo 

1 1 

6. (a) y = CiX + c 2 (x 2 3 + 1) + — x 4 5 6 - — x 2 ; 

(b) y = + c 2 x -1 -x-l-^x 2 ; 

(c) y = c 1 x + c 2 e J: + x 2 + l; 

l 

(d) y = c x e* +c 2 (x + l) + — e 2l '(x-l); 

(e) y = cix + c 2 x 2 - xe” T - (x 2 + x)J* - 

elementary function. 


-dx, where this integral is not an 


Section 20 

1. The frequency is — J— -„ when — --— is positive, which is 

4 3 2n V M 2M 2 M 2M 2 

k c 2 

more restrictive than the condition that- T > 0. 

M 4M 2 

3. 2n^j2r/3g seconds. 

4. About 574 pounds. 

5. The round trip time is 2n^jR/g seconds, where R is the radius of the 
earth; this is approximately 90 minutes. The greatest speed is approxi¬ 
mately 0.074L miles/minute or 4.43L miles/hour. 

1 1 

6. x = — cos4f + — sin4f-fcos4f. 

2 4 
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Section 21 

1. (a) 2V2 years. 

(b) 3>/3 years. 

(c) 125 years. 

2. (a) About 0.39 astronomical units or 36,000,000 miles, 
(b) About 29.5 years. 


Section 22 

1. y-c 1 +c 2 e x +c 3 e 2x . 

2. y = c r e x + e x (c 2 cos x + c 3 sin x). 

3. y = Cie* + e ~ x/2 1 c 2 cos — V3x + c 3 sin — V3x 

4. y = Cie - * + e x/ 2 1 c 2 cos — *j3x + c 3 sin — >/3x 

5. y =(c 4 + c 2 x + c 3 x 2 )e _ *. 

6. y = (c 4 + c 2 x + c 3 x 2 + c 4 x 3 )e _ *. 

7. y = c 3 e x + c 2 e~ x + c 3 cos x + c 4 sin x. 

8. y = c 4 cos x + c 2 sin x + c 3 cos 2x + c 4 sin 2x. 

9. y = (c 4 + c 2 x)e ax +(c 3 + c 4 x)e _ “. 

10. y = (c 4 + c 2 x)e _I + c 3 cos x + c 4 sin x. 

11. y = (c l + c 2 x)e~ x + c 3 cos x + c 4 sin x. 

12 . y = (c 4 + c 2 x)e* + e _2l '(c 3 cos x + c 4 sin x). 

13. y = c 1 e sc +c 2 e 2 *+c 3 e 3 *. 

14. y = c x e 2x +(c 2 + c 3 x + c 4 x 2 )e -1 '. 

15. y=(c 4 + c 2 x)e 2x +(c 3 + c 4 x)e _2:t +c 5 e fa . 


d 4 Xi 

17. — 

df 4 


k\ +^3 k 2 +^3 




wz 2 


d 2 Xi 


dt 2 


'(k 1 + k 3 \ 

r?c 2 +fc 3 N 

^3 

{ m i J 

l m 2 , 

miM 2 


18. Xi = Ci cos —t + c 2 sin —t + c 3 cos —f+ c 4 sin 


3k 1 

77Z ’ 271 


X! = 0. 

Y 

m 
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19. y = CjX 3 + c 2 x 2 + c 3 x + c 4 + sin x + x 4 . 

20. y-c r + c 2 e x + c 3 e 2x + 5x + 7e 3x . 

9 1 

21 . y = —e x — e~ x -x. 

2 2 

22. (a) y = q + c 2 x + c 3 x _1 ; 

(b) y = qx + c 2 x 2 +c 3 x _1 ; 

(c) y = CjX + c 2 cos (log x) + c 3 sin (log x). 


Section 23 



2. y = — (9x 2 -2Ax + 26)e 2x . 


27 



4 1 2 , 

4. y = —xe. 


2 



7. y-x 3 - 6x - 5. 

8. y = 2x 3 + 9x 2 + 40x + 73. 

9. y-x 4 - 48x 2 + 384. 

10. y = - — x 5 - — x 3 -2x. 

J 60 3 

11. y = —x 10 — 151,200x 4 . 

12. y-x 4 + 4x 3 + 24x 2 + 69x +117. 

13. y-x 4 - 12x 2 + 24. 

14. y = -2x 3 - 5x 2 - lOx - 10. 

15. y = — x 4 - —x 3 + —x 2 -21x + 21. 

4 3 2 

16. y--e 2x {x 3 + 6x). 


1 
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18. y = 2e 2x (x 2 +4x + 6) + -^e 2x . 

19. y--2x 2 . 

20. y-x 3 - 1. 

21 . y = -lx 1 . 

1 

22 - y = -*(iogx-i). 
i o 

23. y = —x 2 + 2x + l. 
j 2 

1 

24. y =—(4x 3 -6x 2 + 6x-3). 

’ 48 


25. (a) y = |^—x +cqx +c 2 x + c 3 le ; 

(b) y = (2x 3 + c 1 x 2 + c 2 x + c 3 )e _A: ; 

(c) y = (-sin x + cpx + c 2 )e 2x . 


Section 24 

3. u" + 


Ax 2 


u = 0 . 


Section 25 

3. If/(x) > 0 and k > 0, then every solution of the equation y" + \f(x) + k\y = 0 
has an infinite number of positive zeros. 


Section 26 

6. V ” nx"~\ 
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Section 27 


1 . (a) y = a 0 


2 x 4 x 6 x 8 

1 + x + — + — + — + ••• 
2! 3! 4! 


= «o£ ; 


(b) y = fl 0 - (flo ~ 1)* + ^ x 2 - ^ x 3 + ■ 


2 ! 


3! 


— 1 + (a 0 — 1) 


1 x 2 x 3 

1 X I ~ - 4 • 

v 2! 3! , 


— 1 + («o _ l)e ' • 


2 . (a) y-ayx, no discrepancies; 

(b) y = 0, y = ce~ 1/x , the latter being analytic at x = 0 only when c = 0. 

„ . j 1 ■ 3• ■ • (2n-1) x 2n+1 

3. sin x = x+ x 

5 - y = 


Z °° i • 3 • ■ 
n =1 2-4-- 


2-4---(2«) 2n + l 


x 2 x 3 x 4 

2T3! + 4!" 


Z' 2 3 4 

l-x +-+— 

2! 3! 4! 


+ x —1 = e x + x — 1. 


Section 28 


1 . y = a 0 | 1 + x 2 ——x 4 + — x 6 -—x 8 + - 
J 1 3 5 7 


+ fljX 


= a 0 (1 + x tan 1 x) + a x x. 

x 2 x 4 x 6 

2 . (a) y 1 (x) = l- —+ —--— 

* 2 2-4 2-4-6 


H-, 


3 S 7 

, V X X X 

l/ 2 (x) = x-+-+ • 

J 3 3-5 3-5-7 


3. Q-n+2 — 


(n + l)a„ + i-fl„_i 
(n + l)(n + 2) 


, v , v x 3 x 4 x 5 

(a.) i/i(x) — 1 H- 1 -h • 

\ / j ± \ / 23 234 2345 


4x 5 


. , . x 2 x 3 x 4 

(b) y 2 (x) = x-+-+- 

v/c/x/ 2 2-3 2 - 3-4 2 - 3 - 4-5 


+ •••. 
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. , . p-n 

4. (c) a n+2 = --—^-— a n , 


(n+l)(n + 2) 
iu(x) = a 0 1 r ~ * 1 2 


1-j^+ rvP 2 ) x 4 . 
2! 4! 


+ 


3! 


-l) x 3 , (p-l)(p-3) x5 


5. (b) y(x) = a 0 


1 + 


Z co 

n=l 2 • 


5! 

(-l)"x 3 " 


+ U\ 


x + 


= 1 2-5-8---(3n-l)3"«! 
(-l)"x 3 " +1 


(-1)' 

Z—tn =1 4 - 7 - 1 0 -f 


(c) y(x) = a 0 


1 + 


i4-7-10---(3?i + l)3"«! 

x 3n 


~\~ Cl\ 


<=i2-5-8---(3k-1)3"«! 

x 3n+1 

X Z^„=i4.7.io---(3« + 1)3"«! 

Ma)y,W = l -^+*Ez?ME±V x <-., 

, (p-l)(p-3)(p + l)(p + 3) ^5 
5! 


Section 29 

1. (a) x = 0 irregular, x = 1 regular; 

(b) x = 0 and x = 1 regular, x = -1 irregular; 

(c) x = 0 irregular; 

1 

(d) x = 0 and x = - — regular. 

2. (a) ordinary point; 

(b) ordinary point; 

(c) regular singular point; 

(d) regular singular point; 

(e) irregular singular point. 

3. (a) m(m - 1) - 2m + 2 = 0, m, = 2, m 2 =1; 

5 1 1 

(b) m(m-l)-— m + — = 0, mi = 2, m 2 = — ■ 
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4. (a) yi(x) = x 


_ v l/2 


X X 

1-+- 

3! 5! 


= sinVx, 


y 2 (x) = 1 - — + : -= cos yfx ; 

J 2! 4! 


Z °° X 

-- 

»=o 1 • 3 • 5 • • • 
y 2 (x) = x~ 1/2 ^ 


(2n + l) 


—— = x~ 1/2 e x/2 : 


n= o 2"n! 
21 


(c) y 1 (x) = x / j^l--x + —x 
y 2 (x) = 1 - 3x + 2x * 1 2 3 + • • •; 

(d) y 1 (x) = xj^l + |x + ^x 2 + - 


y 2 (x) = x 2 ^1-x- —x +••• 
6- (b) y 2 (x) = -xe 17 *. 


1 ..2 


Section 30 


1. y = x 2 (l -4x + 4x 2 +- ■ ■). 

2. y = c 1 x 1/2 e 1 ' + c 2 x 1/2 e T log x. 

r 2 r 4 

3. (a) i/i = 1 — -— + --= x ^inx, 

J 3! 5! 


y 2 =x~ 


1 x x 

1 - — + —- 

2! 4! 


= x ^osx; 


/IX 2 1 1 1 , 1 3 

(b) yi = x 1 + — x + —x"-x + 


20 


60 


-if, 1 1 2 1 4 

J l 2 2 8 J 


(c) yj = x 2 


1 x 4 * x 8 

1-+- 


v 


3! 5! 


= smx , 


y 


x 4 X s 

1 / 2=1 -+- 

3 2! 4! 


4. y = x 


f 1 4 

' X X 

1 - 3 + - 


2 2 ! 2 2 ! 3 ! 


• = cosx 

\ 

y 
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5. \ji = x 


1/2 


y = x 


4/2 


1 - — + — -• 
3! 5! 

x * 2 * x 4 

1 -+-.» 

2! 4! 


- 1/2 • 

= x ' sm x. 


= X 1/2 COS X. 


Section 31 


2. (a) y = CiF| 2,-1, — ,x | + c 2 x 7 F 


3 1 


= Ci | 1 - ^ x | + c 2 x 1/2 F 


2 2 2 
3 3 1 


,x 


2 2 


,x ; 


= Ci 


III 

2 2 

1 


(b) y = CiF| -,1,-,-x | + c 2 (-x) F| 1,-,-,-x 


3 3 


1 + X 


+ c 2 


(-*) 


1/2 


1 + x 


> i x+n fx+iV / 2 r f 5 5 3 x+n 




+ c 2 


3-x 


-9/5 


444 3-x 
1 5' 5' 5' 5 


4. (a) y = c 1 F(p,l,p,x) + c 2 x 1_ PF(l,2 - p,2 - p,x); 
1 


(b) y = Ci 


1 — X 


+ c 2 


^ x 1 ^ ^ 

1 — X 




5. y = Cl F|l,-l,-^,l- e * | + c 2 (l-e*) 3/2 F| |,I |. 


Section 32 

1. (a) A regular singular point with exponents p +1 and -p. 
(b) An irregular singular point. 
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Section 33 


1. 

2 . 

3. 


3n 


i; 


00 (-1)" +1 cos (2n - l)x - sin (2ft - l)x + sin 2(2ft - l)x 


2 ft -1 


1 1 v' ” (-1)" +1 cos (2ft - l)x + sin (2ft - l)x + sin 2(2 n - l)x 


- + - 

4 71 

1_2 
71 71 


s; 


2ft-1 


Z 00 cos 2 nx 1 

—=—- + — smx- 

2 /I 


4. —cosx + 

2 71 


4ft -1 2 

4 ftsin2ftx 

2 /I ■M 2 


4« -1 


5. (a) Tt; 

(b) sinx; 

(c) cos x; 

(d) tt + sin x + cos x. 

Notice that any finite trigonometric series is automatically the Fourier 
series of its sum. 

, , . 4 a( . sin3x sin5x 

6. (a) — smx +-+-h- 

71 v 3 5 

,, N 4( . sin3x sin5x 

(b) — smx +-+-+ •• 

7i v 3 5 

.. . sin3x sin5x 

(c) smxH- h-+ •••; 


(d) 

1 

6 | 

f . sin3x 

sin 5x 'i 

— + 

- 



2 

71 1 

l 3 

5 ) 

(e) 

3 

21 

f . sin 3x 

sin 5x ^ 

— + 


smx +- 



7. After forming the suggested series, continue by subtracting from the 
series in Problem 1, then dividing by Tt. 


Section 34 


2 . 

3 . 


f(x) = 


/(*) = 


71 

4 


7t 

4 


2 00 cos(2ft -l)x "ST™, 1 ,„+isinftx 

7t (2ft -l) 2 ft 

2 cos(2ft-l)x | 0 ^ x sin(2n-l)x 
~7t^i (2ft-l) 2 +i3 Zji 2n-1 



sin2ftx 

2ft 
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In each case, y 
sinhn 


5. /(*) = ■ 


1 + 2 


(2«-l) 2 I 
(- 1 )" 


z 


1 77+1 


cost7x-2 


z: 


H) n » 

« 2 + i 


sin«i 


Section 35 


1. Even, odd, neither, odd, even, even, neither, odd. 


= — (this concrete sum, familiar to us from elementary 


4. 1-- + - 
3 5 

calculus, provides strong emphasis for the very remarkable nature of 
the sine series we are considering: as x varies continuously between 0 
and it, each term of the series changes in value, but these changes are 
so delicately interrelated that the sum of all these variable quantities is 
constantly equal to — —astounding!);^. 

4 4 

s.Vr cos “ 

71 K 


, . 2 4 

6. smx;- 

n n 


z; 


4n -1 
cos 2 nx 


,0 < X < 71. 


3 cos(2t 7 — l)x 
« (277 -l) 2 


477 -1 

4 x—^ 01 

7. /(x)= 4 y 

71 '^—'1 

o / \ 0 V' ,CO , , V! sinnx 

8. (a) 7i - x = 71 + 2 > (-1) -; 

t—i\ n 

4 v - '” cos(277 - l)x 


(b) n-x = - + 

2 n 


z; 


(2/1-l) 2 


(c) 7i - x= 2 y 

2 v'~/ iv ,+isin77x 8 V'' 30 sin (2?7 - l)x 


3 Sm77X 

'1 77 


10. (b) x 2 = 27 t y; ( -l) 

1 V -r 2 

71 -^1 

1 ^ 3< 

71 


15. sin" x = 


cos" x = 


Z oo 

1 

_1 

277 — 1 2?7 + 1 277 -3 

2 1 _1 

277 -1 277 +1 277 -3 


, 0 < X < 71. 


(2?7 -If 
sin (2?7 - l)x, 0 < x < 7i; 
sin (277 - l)x, 0 < x < 7i. 
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Section 36 


12 x—^ 00 1 KX 

1. /(*) = — > —sin(2n-l)—. 

7 Ti-^i 2n-l 2 

2. (a) / W -l + i r y* C ° S<2,1 ~ * 1 2 ) " ; 

■ /W 2 7i 2 ^i (2n-l) 2 


ji 

8 


(b) f( X )= i-4-V 

71 

„ ,, . 1 * COS 2?771X 

5./W-1 + -X,' 

71 n 
OS 7tJ 
7 V—1 a 

7 -/<*>-?!, 


——, cos {In -1)—. 
i (2;i — l) 2 2 


Hr cos (2n-l)H 

2n-l 2 


6. COS TtX = COS TlX. 

2 ^ ” cos 2(2n - 1 )tix 

(2 m - 1) 2 


Section 38 


4. &l=-A=0,&3=^A=0,fc 5 = ^ 

71 371 371 

5. frj = 2, fc 2 =-l, fc 3 = |. 


Section 40 


1. (a) = 4m 2 , y n (x) - sin 277x; 

77 2 1 

(b) y„(x) = sin —tix; 


(c) X„ = 77 2 7t 2 , y„(x) = sin 777tx; 

9 9 

777TX _ 

L 1 ” 1_ r ; 


/ 1\ Cl * *• » V / \ • < 

(d) X,„ = —2-,y„(x) = sin- 


2 2 

77 71 


t x - - -v . > nn(x + L) 

(e) *•» = y,.(*) = sm— 
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(f) K = 


2 2 
n n 


, . . nn(x-a) 

2 ,y„(x) = sin—--- 


(b-a) ' b-a 

8c co / 1N „+i sin(2n -l)xcos(2« -l)«i 
n 2 


5. (a) y(x,i )= 8 2 Y Yl)" 

71 


(2n-l) 2 

n * 1 ' 1 


(2n -1) 


(c) y(*,t) = -Y 

71 O 


. 7771 . 37771 

sm— + sm- 

4 4 


sin?7XCOS 77flf. 


Section 41 

Z °o _ 2 2. 

b n e sinnx + g(x), 

1 2 I” 1 

where g(x) = h?! + — (ie 2 -wf)x and b„ = — [/(x)-g(x)]sinnxdx. 
71 71 Jo 

Z oo 2 2. 

b n e~ n 11 f sinnx, 

2 f 71 

where b n = I f(x) sin nx dx. 

71 Jo 

5. —1 =0,-1 = 0;ie(x,t) =100. 

5xi =0 dx j x=n V ; 

1 % ^ oo 2 2. 

6 . w(x, t) = —a 0 + 2^ “ f cos nx, where 


tt n " 


2 f 71 

-I f(x)cosnxdx for 77 = 0,1,2,.... 

: Jo 

Z o° 2 f 71 

b n e ny sin nx, where b n = — /(x) 

1 71 Jo 


71 Jo 


sinnx dx. 


Section 42 


2 . (a) H7(r,0) = ---Y°°(-l) n 

71 71 


7’" COS 770 

4« 2 -1 ' 


1 1 

(b) 7c(r,0)=2| rsin0- —;' 2 sin20 + —;' 3 sin30-- 
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2 7i v 3 5 

IS720 




3. (a) (1 - x 2 )\i" - 2x\i' + p(p + l)|i = 0; 

(b) x 2 \ f + 3 xyL + (1 + x 2 - p 2 )(i = 0; 

(c) (1 - x 2 )\i" - 3 x\i' + (p 2 - l)p = 0; 

(d) |i" + 2x|i' + (2 + 2p)|i = 0; 

(e) (i" + x|i = 0; 

(f) x(i'' + (l+x)|i' + (l+p)p = 0. 



.2 


6. (b) Legendre's and Airy's. 
8. (a) [(1 -x 2 )y']' + p(p + l)y = 0; 



(d) [e _ * 2 y']' + 2pe~* 2 y = 0; 

(e) [y']' + xy=0; 

(f) \xe~ x y ']' + pe~ x y = 0. 

10. (b) / H) = 0, y 0 (x)=:l; X n = n 2 for n = 1,2,3, . . ., and the eigenfunctions corre¬ 


sponding to each of these X n are cos nx and sin nx. 
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Section 44 

2. (c) P 2 (x) = |(3x 2 -l), 

P 3 (x) = l(5x 3 -3x), 

P 4 (x) = —(35x 4 - 30x 2 + 3), 

8 

P 5 (x) = — (63x 5 -70x 3 + 15x). 
8 


Section 45 

4. (a) /(x) = jP 0 (x) + ^Pfx) + ^rP 2 (x) + ■■■; 

4 z 16 

(b) f(x) = l(e - e-')P 0 (x) + 3e _1 Pi(x) + l(5c - 35^ 1 )P 2 (x) + • • •. 


Section 46 

7. y = x~ c [cj p (ax b ) + ^“'’(ax 6 )] if p is not an integer; 
y = x~ c [cj p (ax b ) + c 2 Y p (ax b )] in all cases. 


Section 47 


3- J 2 (x) = —7i(x)-/ 0 (x); 
x 


/3(x)=|4-i|W*)--/o(*); 


48 8 


/4(X) = I#-X l /l( x ) -|§- 1 l/° ( x ) - 
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Section 48 


3. L[sin 2 flx] = 


p p 2 + 4 a 2 


and L[cos 2 ax] = 


4. (a) 


/ -v 

of these transforms is the transform of 1 (=1 Ip). 
10 


1+ P 

p p 2 + 4a y 


; the sum 


... 5! 

(b) „6 + „2 


(C) 


p 6 p 2 + 4' 
2 5 


p-3 p +25 


(d) -^- + 


+ 4 p + 1 


, v 6! 

(e) -y ■ 


5. (a) 5x 3 ; 

(b) 2e- 3 *; 

(c) 2x 2 + 3 sin 2x; 

(d) 1 - e~ x ; 

(e) x - sin x. 


Section 49 


2 ‘ (a) pe ap ' 


(b) 


P(e”-1Y 


e p -1 - p 

pV-i) ; 


(d) 


l + e -’ 1 ' 7 

p 2 +l ' 
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Section 50 




1 2 ! 


,3 ' 


(C) 


P~ 3 


2. (a) 2e~ 2x sin 3x; 

(b) 2e~ 3x x 3 ; 

(c) e~ x cos 2x + e~ x sin 2x. 

3. (a) y(x) = -e~ x +e 2x ; 

(b) y(x) = 3xe 2x -, 

(c) y(x) = 1 - er x cos x; 

(d) y(x ) = -5 + 6x - 3x 2 + x 3 + 5e~ x ; 

(e) y(x) = er x sin 2x + e~ x sin x. 

4. y(x) = y 0 e ax +(y' 0 -ay 0 )xe ax . 

5. 1 - e~ x . 

r ^ -2x • 4 -2x 1 -x 

6 . i/ = — e sin x + — e ' cos x — e . 

2 2 2 


Section 51 





3. (a) y(x) = cx 2 e x ; 
(b) y(x) - xe~ x . 
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5. (a) log-; 
a 

(b) tan -1 -. 

a 


8 . (a) 


p(l+e p ) 


Section 52 

2 . (a) y(x) = cos x; 

(b) y(x) = e 2 *; 

(c) y(x) = e- x (x -1) 2 ; 

(d) y(x) = -2 sin x + 4 sin 2x. 
4. y-cx. 


Section 53 


2 . (a) — (1-cosflt); 
a 


(b) -^ T {e at -e u ); 


Jl_ 

a-b 

1 


(c) - T (e a — 1—«#); 
a 


(d) 


fl 2 -b 2 


(asinbt -b sin at). 


m / \ 1 3f O - 

4- (a) y = -^ +-e 
6 6 


■ 3 f -e- 2 ‘; 


(b) y = - —t - —-— e“ 3( + —e 2t ; 

J 6 36 45 20 


(c) y = 2e*-h 3 -t 2 -2t-2. 
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5 . (a) y(f)= f f /(x)[e 

Jo 


-((-,) _ e -2( f -T)] dx; 


/-i \ 1 31 1 _/ 1 _2f 

(b) ——e +—e 
20 4 5 


7.A(t) = -|l-cos,/-t 


and 


M0 = 


1, 3 _ t 1 a 
—t — + e —e . 

2 4 4 


’ VMfc 


sin V 


8 . (a) J(0 = ^[l- e -“ /L ]; 
(b) I(t) = |V Rf/L ; 


(c) 1(0 = 


Jr 2 + lW 

where tan a = Loi/R. 


sin (rat-a) + „ e Rt/L 


R + lW 


Section 54 


1 . (a) ^ = z 
dx 


dz 2 

— = xy + x Z; 
dx 


(b) ^ = z 


= ze 


2 2 

= w-x z . 


dx 
dz 
dx 
div 
dx 

r, dX 

2. - = Z7 r 

dt 

dv x = f(t,x,y) 
dt m 

dy 
dt 

dv y _ g{t,x,y) 
dt m 


= v y 
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Section 55 

5. (b) jx = c x e il + c 2 e~ 2t 

[y = c 1 e -c 2 e ; 

(c) \x = 3e it + 2e~ 2t 
|y = 3e 4 ' - 2e~ 2t . 

6. (b) jx = 2 Cl e 4t + c 2 e~* 

jy = 3cje 4f - c 2 e~ f ; 

(c) jx = 2c 1 e 4 ' + c 2 e~‘ + 3f - 2 
jy = 3cie 4f - c 2 e~' -2t + 3. 

8. | x = Cie f + Cite* 

y = c 2 e'. 


9. (a) jx = c 1 e t 
y = c 2 e*. 


Section 56 

1. (a) jx = 2cie~* + c 2 e* 

[y = cie _f +c 2 e'; 

(b) jx = e 3t (2c 1 cos3t + 2c 2 sin3t) 

jy = e 3t [ci(cos 3t + 3 sin 3f) + c 2 (sin 3t - 3 cos 3f)]; 

(c) |x = -2cie 3 ' + c 2 (l + 2f)e 3f 
jy = c^e 3 ' - c 2 te 3t ; 

(d) jx = 3ci + c 2 e~ 21 

jy = 4c 1 +2c 2 e“ 2f ; 

(e) jx- c^e 2 * 

[y = c 2 e 3t ; 

(f) jx = Cie~ 3t + c 2 ( 1 - t)e~ 3t 
\y = -Cie~ 3t +c 2 te~ 3i ■, 
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(g) jx = 2c 1 e 10f + 3 c 2 e ii 
jy = C\e m -2 c 2 e 3t ; 


(h) Jx = e * 2 3 4 5 6 '(c 1 cos2f + c 2 sin2 1 ) 

[y = e 3f [ci(sin2f-cos2t)-c 2 (sin2t+ cos 2 1 )\- 


5. (b) fx = 3f+ 2 


y = 2f-l 


Section 57 



2 . The fox curve is concave up whenever the rabbit curve is rising. 


Section 58 

2. Put c = t 1 - t 2 and use uniqueness. 

3. They are the same except that the directions of all paths are reversed in 
passing from one to the other. 

4. (a) Every point is a critical point, and there are no paths. 

(b) Every point on the y-axis is a critical point, and the paths are hori¬ 
zontal half-lines directed out to the left and right from the y-axis. 

(c) There are no critical points, and the paths are straight lines with 
slope 2 directed up to the right. 

(d) The point (0,0) is the only critical point, and the paths are half-lines 
of all possible slopes directed in toward the origin. 

5. For equations (1) and (2), they are (0,0), (±ji,0), (±2ji,0), (±3ji, 0), ...; and for 
equations (3), (0,0) is the only critical point. 


6 . (a) (-2,0), (0,0), (1,0) 
(b) (2,2), (3,3) 
x = c-ie 1 

y = ci e‘ +e‘ +c 2 - 


7. 
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Section 59 

1. (a) (i) The critical points are the points on the x-axis; 

(ii) dy/dx = 2xy/(x 2 +1). 

(iii) y = c(x 2 + 1). 

(b) (i) (0,0). 

(ii) dy/dx = -x/y; 

(iii) x 2 + y 2 = c 2 . 

(c) (i) There are no critical points; 

(ii) dy/dx = cos x; 

(iii) y = sin x + c. 

(d) (i) The critical points are the points on the y-axis; 

(ii) dy/dx = -2xy 2 ; 

(iii) y = l/(x 2 + c) and y = 0. 

2. (a) (i) Jx = c,e‘ 

[y = c 2 e "'i 


(ii) dy/dx = -y/x; 

(iii) xy = c; 

(iv) unstable. 

(b) (i)Jx = Cl e- f 

|y = c 2 e~ 2t ; 


(ii) dy/dx = 2y/x; 

(iii) y = cx 2 ; 

(iv) asymptotically stable. 

(c) (i) Jx = 2ci cos 2t + 2c 2 sin 2t 

|y = -Ci sin 2 1 + c 2 cos 2 f; 


(ii) 


dy _ -x 
dx Ay' 


(iii) 


2 2 

x y i 

77 H—7 — 1/ 

4c c 2 


(iv) stable but no asymptotically stable. 
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Section 60 


1. (a) Unstable node; 

(b) Asymptotically stable spiral; 

(c) Unstable saddle point; 

(d) Stable but not asymptotically stable center; 

(e) Asymptotically stable node; 

(f) The critical point is not isolated; 

(g) Unstable spiral. 

3. (c) The critical point is (-3,2), the transformed system is 


dx 
dt 
dff 
. dt 


2x-2y 


llx -8 y, 


and the critical point is an asymptotically stable node. 

4. (a) m 2 + 2 bm + a 2 = 0; p = 2b, q = a 2 . 

(b) (i) A stable but not asymptotically stable center; the mass oscillates; 
the displacement x and velocity y=dx/dt are periodic functions 
of time. 

(ii) An asymptotically stable spiral; the mass executes damped 
oscillations; x and dx/dt —> 0 through smaller and smaller 
oscillations. 

(iii) An asymptotically stable node; the mass does not oscillate; x and 
dx/dt -*■ 0 without oscillating. 

(iv) The same as (iii). 

5. a 2 x 2 - 2apcy - hyy 2 = c. 


Section 61 

1. (a) Neither; 

(b) Positive definite; 

(c) Neither; 

(d) Negative definite. 
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Section 62 

2. dx J 2j2 y-y 3 

dx x 3 -1xy 2 

3. Put D = -pg = (flj + b^)(a 1 b 2 - ajb^) > 0. 

4. No conclusion can be drawn about the stability properties of the non¬ 
linear system (4) at (0,0) when the related linear system (3) has a center 
at (0,0). 

5. (a) Unstable spiral; 

(b) Asymptotically stable node. 

6 . The critical point (0,0) is unstable if |i>0 and asymptotically stable if 

y<0. 


Section 63 

1. If/(0) = 0 and x/(x)< 0 for x * 0, the critical point is an unstable saddle 
point. 

3. y 2 - x 2 + x 4 = 2 E; {--Jl/2, 0) is a center; (0,0) is a saddle point; and {-J2./2 ,0) 
is a center. 

4. When z = F(x) has a maximum, the critical point is a saddle point; when 
it has a minimum, the critical point is a center; and when it has a point 
of inflection, the critical point is a cusp. 


Section 64 

dr 


2 . (a) 


dt 

de 

dt 


= r(4-r 2 ) 
= -4; 


(c) 


_ _ 2cos4(f + t 0 ) 

Vl + ce -8 ' 

_ -2sin4(f + f 0 ) 

V ~ Vi+«r 8f 


fx = 2cos4f 
[y = -2sin4f. 
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4. (a) A periodic solution (Lienard's theorem); 

(b) No periodic solution (Theorem B); 

(c) No periodic solution (Theorem A); 

(d) No periodic solution (Theorem B); 

(e) A periodic solution (Lienard's theorem). 


Section 66 

1. (a) (x-c 2 ) 2 + y 2 = cf 

(b) y-c l sin (x - c 2 ). 

2 - V = ^{x 2 -x). 

4. (a) c x = r cos (0 - c 2 ); 
(b) Same as (a). 


Section 67 

3. (a) x = 


ad 


a * 1 2 + b 2 + c 2 ' V a 2 + b 2 + c 2 ' 
cd 


bd 


z = - 


a 2 + b 2 + c 2 ’ 

5. The catenary y + A. = c x cosh 


r x - c 2 ' 
v c i 


Section 68 


1. y = -= l + x + x 2 +---,|x|<l; 

1 — x 


1 

i/i(x) = \ + x,y 2 (x) = 1 + x + x 2 + —x 3 4 . 


. „3 . 2 4 1 5 1 1 1 1 JJ 


t/ 3 (x) = l + x + x +x +—x + —x + —x + — X . 
y 3 3 9 63 
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2. y = e x -1; 

x 4 

y 1 (x) = x 2 , y 2 (x) = x 2 + — , 

r 4 r 6 

y 3 (x) = x 2 -\ -+-—, 

7 2 2-3 

. . 2 * 4 X 6 X® 

1/ 4 (X) = X H-+-+-. 

^ v 7 2 2-3 2-3-4 

3. (a) y n {x) = ^ + ^ + - + ^^ + e x ^{e x -x-l) + e x ; 


(b) y„(x) = l + x + 2 

—> l + x + 2(e x - x-1); 


X 2 X 3 X 4 x" +1 

-1-1-1- • • • H- 

2! 3! 4! (« + l)! 


(c) y 1 (x) = (sinx-x) + l + x + 


2 ! 


y 2 (x)= - 
ya(x) = - 

f 

y 4 (x) = 


2 A 

1 * 

COS X -1 +- 

2 ! 


+ l + X + (X 2 )+|y, 


smx-x + - 


3! 


+1 + x + 


2 X 

x + — 


3 1 


7 


4! 


2 4 \ 

. X X 

cos x -1 +- 

2i 41 

v y 


+1 + x + 


7 3 4 \ 

2 X X 
X + — +- 

v 3 3-4 y 


5! 


Section 69 

6. (a) All points (x 0/ y 0 ) with y 0 * 0; 

(b) All points (x 0 , y 0 ) since/(x,y)= |y| satisfies a Lipschitz condition on 
every rectangle. 

7. All points (x 0/ y 0 ). 


Section 70 

1. f y = cos x 


z = -smx. 
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Cotangent, Euler's partial fractions 
expansion of, 318 
Coupled harmonic oscillators, 158 
Courant, R., 179, 334,416, 617 
Crelle, August L., 485-486 
Critically damped motion, 140 
Critically damped vibration, 137-138 
Critical points, 516 

asymptotically stable, 527, 

537-538, 552-555 
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borderline case, 526 
centers, 523-524,536-537 
focus, 524 
isolated, 527, 548 
node, 520-522,531-532,534-536 
of nonlinear systems, 547-555 
path approaches, 521 
path enters, 521 
physical interpretation, 516 
saddle points, 522-523,532 
simple, 548-552 
spirals, 524-526, 532-534 
stability for linear systems, 
529-539 
stable, 527 

two-dimensional vector field, 516 
unstable, 527 
vortex, 523 

Curvature, mean, 617 

Curve(s) 
integral, 8 

one-parameter family of, 8 
pursuit, 88 
stationary, 587 

Cycloid, 44, 47-48, 88, 474, 592 


d'Alembert, Jean le Rond, 355 
formula, 363 
principle, 355 

solution of wave equation, 351 
Damped vibrations, 138-141 
Damping force, 138, 558 
Damping, linear, 558 
Darwin, Sir G. H., 574-575 
Dating, radiocarbon, 25 
Davis, Philip ]., 354 
Day, W. D., 480 
Decay 

exponential, 23 
radioactive, 23 
Dedekind, Richard, 265 
Definite integral, 6 
Degrees of freedom, 611 
Delta function, Dirac, 457 
de Moivre's formula, 270 
Dependent variable, 85-86 
Descartes, Rene, 45 


Differential equation, 1 
complete, 109 
exact, 70 
linear, 81 
normal form, 191 
order, 3 
ordinary, 3 
ordinary point, 210 
partial, 3 
reduced, 109 
singular point, 210, 219 
irregular, 220 
regular, 220 

standard form, 191 (see also Equation) 
Differential, exact, 70 
Diophantus, 174 
Dirac, P. A. M., 457 
delta function, 457 
Directed curve, 516 
Dirichlet, P. G. L., 306-307 
conditions, 283 
kernel, 345 

problem, 372-378, 433 
for a circle, 373 
theorem, 304 
Discontinuity 

jump, 301-302, 349 
simple, 301-302 
Discretization error 
local, 651 
total, 651 
Distance, 281 

between two functions, 333 
mean, 153 
Doubling time, 22 
Douglas, J., 617 
Dunnington, G. Waldo, 265 
Dynamical problems, variable mass, 
103-105 

Dynamical system, conservative, 557 

E 

e, 19 

Eccentricity, 152-153 
Eddington, Sir Arthur, 490 
Eigenfunction expansion, 383 
Eigenfunctions, 259, 356, 358, 360-361, 
380-381,388-392 
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Eigenvalues, 259, 356, 359,380,388-392 
Einstein, A., 266, 285,433, 514, 575, 582 
on doubting the obvious, 433 
on future of mathematical 
physics, 514 
and Poincare, 514 
relativity impossible without 
Gauss, 266 

on Riemannian geometry, 285 
special theory of relativity, 104 
use of calculus of variations, 582 
variable mass and E = Me 2 , 103 
Electric circuits, 95-98 
Electromotive force (emf), 95-96 
Electrostatic dipole potential, 433-434 
Electrostatic potential, 428 
Elementary functions, 107,197 
Ellipse, 151 
Elliptic integral, 36 
first kind, 36 
second kind, 36 
Energy 

conservation of, 32, 559 
kinetic, 544 
potential, 544 
Equation(s) 

Abel's integral, 472 
adjoint, 386 
Airy's, 218, 387 
auxiliary, 123,156, 462 
of the system, 499 
Bernoulli's, 83 

Bessel's (see Bessel's equation) 
Chebyshev's, 218, 241,387,392 
complete, 109 

differential (see Differential equation) 
equidimensional, Euler's, 126,161, 
374, 386 

Euler's, for calculus of variations, 587 
exact, 69-73 
heat, 3, 366-371 
Hermite's, 218, 250, 387, 392 
homogeneous, 65-67,109 
hypergeometric 
confluent, 244 
Gauss's, 236-240 
generalized, 281 
indicial, 248, 280 
integral (see Integral equation) 


Lagrange's, 607 
Laguerre's, 245, 329, 334 
Laplace's, 3, 316, 365 
Legendre's 3,178, 387, 392 
Lienard's, 568-569 
linear differential, 95 
of motion, for undamped pendulum, 
34-37, 514 

nonhomogeneous, 109 
one-dimensional heat, 366 
one-dimensional wave, 351, 357 
Parseval's 340-341 
prey-predator, Volterra's, 508 
reduced, 109 

Riccati (see Riccati equation) 
Riemann's, 278-287 
Schrodinger wave, 259 
second order linear, 81 
self-adjoint, 384, 387 
separable, 6 
Sturm-Liouville, 391 
two-dimensional Laplace, 373 
van der Pol, 514, 517, 557, 572 
wave (see Wave equation) 

Equation of motion, 435-437 
Equidimensional equation, 

Euler's, 126, 374 
Equilibrium point, 527 
Equilibrium populations, 510-511 
Erdelyi, A., 198, 237, 457 
Error 

circular, of pendulum clocks, 35 
local discretization, 651 
total discretization, 651 
total relative, 647 
Escape velocity, 38 
Euclid's theorem, 174 
Euler, Leonhard, 170-179 
characteristic, 178 
circuit, 176 
constant, 173 
equation for calculus of 
variations, 582 

equidimensional equation, 126,161, 
374, 386 
formula(s) 

for complex numbers, 123 

for Fourier coefficients, 289,292,326 

for polyhedra, 177 
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hypergeometric function, 237 
identity for primes, 276 
infinite product for the sine, 318 
irrationality of e, 385 
and Lagrange, 597 
law of quadratic reciprocity, 263 
method, 646 
error, 650-652 

exact and numerical solutions, 
648-649 

geometric realization, 648 
improved, 652-657 
integral of differential 
equation, 646 
piecewise-linear curve, 

647-648 

system of equations, 662-663 
total relative error, 647 
zeroth order Taylor 
polynomial, 647 
minimal surfaces, 616 
partial fractions expansion of the 
cotangent, 318 
path, 176 

on sequence of primes, 277 
sums of series, 172, 377, 406 
theorem on homogeneous 
functions, 612 
vibrating membrane, 435 
Euler's differential equation 
admissible functions, 584-585 
chain rule, 586 
differential geometry, 590 
disturbed functions, 584 
elementary calculus, 585-586 
extremals, 587-588, 593 
fixed choice of function, 587 
geodesics, 590 

stationary function/curve, 587, 592 
stationary points, 587 
stationary values, 587, 592 
x and y missing from function, 588 
x missing from function, 588-589 
y missing from function, 588 
Ewing, G. M., 584 
Exact differential, 70 
Exact equations, 69-72 
Expansion, eigenfunction, 361, 383 
Expansion, Heaviside, 166 


Expansion theorem 
Bessel, 422,440 
Heaviside, 482 
Legendre, 404 

Existence and uniqueness theorems, 625 
Picard's theorem, 626-637 
second order linear equation, 

638-641 
Exponential 
decay, 23 

functions, 19,122, 669 
growth, 21 
order, 453-454,459 
shift rule, 168-169 
Exponents, 208 
Extremal, 584-588 


Factorials, 679-680 
Faires, J. D., 651 
Fall 

free, 32-33 
retarded, 33-34 
Fermat, Pierre de, 45 
last theorem, 46 
principle of least time, 42 
two squares theorem, 46 
Fermi, Enrico, 104 
First order reaction, 22 
Fischer, E., 342 

Focal property of parabolas, 79 
Focus, 524 

Fomin, S. V., 584, 633 
Force 

central, 148 
conservative, 610 
damping, 514 
gravitational, 149 
restoring, 558 
Forced vibrations, 142-144 
Ford, Henry, 146 
Fourier, J. B. J„ 173, 299-300 
coefficients, 289-299,327, 361 
cosine series, 314 
series, 292, 327,354 

arbitrary intervals, 319-321 
convergence, 301-306 
cosine, 314 
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even and odd functions, 310-315 
Fourier coefficients, 289-299 
mean convergence, 336-342 
orthogonal functions, 325-333 
sine, 314, 361 

Fourier-Bessel series, 422 
Fox-rabbit problem, 511 
Fredholm, L, 385, 508 
Freedom, degrees of, 611 
Free fall, 32-33 
Free vibration, 142 
Frequency, 138 
natural, 141 
normal, 161 
resonance, 144 
Frobenius, F. G., 224 
method of, 224 
series, 224, 232-233 
Function(s) 

admissible, 584 
Airy, 218 
algebraic, 197 
analytic, 204 

Bessel (see Bessel functions) 
bounded, 302 
Dirac delta, 457 
distance between two, 330 
elementary, 197 
even, 310 

exponential order, 453 
gamma, 410-413 
generating 

for Bessel functions, 407 
of Legendre polynomials, 398 
harmonic, 373 
Hermite, of order n, 261 
homogeneous, 66 

Euler's theorem on, 613 
hypergeometric, 237 
confluent, 244 
inner product of two, 329 
input, 475-476 
Legendre, 214 
Liapunov, 542-543 
negative definite, 542 
negative semidefinite, 542 
normalized, 326, 379 
norm of, 331 
null, 331 


odd, 315 

orthogonal, 331, 379 
sequence of, 325, 328 
output, 475-476 
periodic, 294 

piecewise continuous, 452 
piecewise smooth, 349 
positive definite, 542 
positive semidefinite, 542 
Riemann's zeta, 262 
Schrodinger wave, 259 
space, 330 

spherical Bessel, 420 
stationary, 587 
transcendental, 197-198 
unit impulse, 457 
unit step, 475 

Fundamental lemma, calculus of 
variations, 587 

Fundamental theorem of calculus, 6 

G 

8,2 

Galileo, 40,48,183 
Gamma function, 410-413 
Gauss, Carl F., 262-270 
and Abel, 485 
complex numbers and 
quaternions, 619 

hypergeometric equation, 236-240 
hypergeometric function, 237 
potential theory, 371 
prime number theorem, 277 
Riemannian geometry, 284 
Riemann's dissertation, 282 
Gauss, Helen W., 262 
Gay-Lussac, Joseph L., 484 
Gelfand, I. M., 584 
Gelfond, A. O., 385-386 
Generalized coordinates, 607 
Generalized hypergeometric 
equation, 281 

General solution, 9,101,109 
Generating function 
Bessel functions, 441 
Hermite polynomials, 253 
Legendre polynomials, 398 
Genus, 178 
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Geodesics, 590 
on cone, 594 
on cylinder, 594, 606 
in physics, 611 
on sphere, 594, 604 
Gibbs, J. W., 619 
Global properties of paths, 564 
Goldstine, Herman H., 644 
Gradient, 608 
Graph, 176 
Graph theory, 176 
Grassmann, H., 619 
Gravitational 
constant, 149 
force, 149-151 
potential, 428 

Gravitation, Newton's law of, 38,149, 
483, 489 
Gray, A., 440, 444 
Green, George, 267 
Green's theorem, 567 
Growth 

exponential, 21 
population, 21-22 

H 

Hadamard, J., 287 
Haldane, J. B. S., 3 
Half-life, 23 
Halley, Edmund, 181 
Halperin, I., 457 

Hamilton, William Rowan, 618-619 
Hamilton's integral, 608, 610 
Hamilton's principle, 582 

action/Hamilton's integral, 608, 610 
Euler's equations, 609 
kinetic energy, 608, 610 
Lagrange's equations 

degrees of freedom, 611-612 
generalized coordinates, 611-612 
law of conservation of energy, 
613-614 

Lagrangian function, 609 
moving particle force, 608 
Newton's second law of motion, 609 
potential energy, 608 
variational problems for double 
integrals, 614-618 


Hamming, R. W., 645 
Hanging chain, 88-94 
Hardy, G. H., 5,175, 385 
Harmonic functions, 373 
Harmonic oscillators, 258-261 
coupled, 155-160 
Harmonic vibrations, simple, 137 
Heat equation, 3, 366-371, 428-429 
one-dimensional, 351 
Heat, specific, 366 
Heaviside, Oliver, 162, 478 
expansion, 166 
expansion theorem, 482 
Heaviside's methods, 162 
Hegel, G. W., 264 
Hermite, Charles, 261 
equation, 218, 392, 486 
functions 

of order n, 256 
orthogonality, 256-258 
polynomials, 219 

generating function, 253 
harmonic oscillator, 258-261 
independent series solutions, 251 
orthogonality, 256-258 
Rodrigues' formula for, 255 
two-term recursion formula, 251 
series, 258 

Herschel, Sir William, 183 
Hersh, Reuben, 354 
Heun, Karl, 653 

Heun's method, see Improved Euler 
method 

Higher transcendental functions, 173 
Hilbert, D„ 267,385,415 
Hiltebeitel, A., 266 
Hobbes, Thomas, 183 
Homogeneous 

boundary conditions, 383 
equations, 65-67,109 

constant coefficients, 122-125 
general solution, 113-117, 

119-121 
function, 66 

Euler's theorem, 613 
linear systems, 491, 498-504 
Homogeneous of degree, 66 
Hooke, Robert, 181 
Humboldt, F. H. A. von, 484 
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Hurewicz, W., 549, 551, 567 
Hurley, James F., 31 
Huygens, Christiaan, 48 
Hyperbola, 151 

Hypergeometric equation, 394 
confluent, 244 
Gauss's, 278 
generalized, 281 
Hypergeometric function, 237 
confluent, 245 
Hypergeometric series, 237 

I 

Identity 

Euler's, for primes, 286 
Riemann's, 280 
Improper integral 

absolute convergence, 452 
comparison test, 452 
convergence, 449 
Improved Euler method, 652-657 
Impulse, 479 

Impulse function, unit, 457, 479 
Impulsive response, 479-480 
Indefinite integral, 5 
Independent variable, 86-87 
Index, 570 

Indicial equation, 225, 232 
Indicial response, 476 
Inequality 

Bessel's, 334,340-341 
isoperimetric, 601 
Minkowski, 332 
Schwarz, 332 
triangle, 333 

Infinity, point at, 242-244 
Initial condition, 9 
Initial value problems, 108-109, 355 
Inner product of two functions, 331 
Inner product of two vectors, 329 
Input function, 475 
Integral curves, 8,11 
Integral, elliptic 
first kind, 36 
second kind, 36 
Integral equation, 470 
Abel's, 472 

Integral formula, Bessel's, 443-445 


Integral, improper, convergence 
of, 410, 449 

Integral, Poisson's, 372-379 
Integral transformation, 448 
Integrating factors, 74-79 
Interest, continuously 

compounded, 20-21 

Interval 

closed, 108,194, 290,325, 389-390, 
491-493, 629 
of convergence, 201 
open, 108,195,389-390, 392 
Inverse Laplace transform, 460 
Inverse Laplace transformation, 460 
Inverse operators, 163-164 
Irregular singular point, 243 
Isolated critical point, 519 
Isoperimetric inequality, 601 
Isoperimetric problems 
definition, 585 
enclosed area, 585 
finite side conditions, 602-605 
integral side conditions, 597-601 
Lagrange multipliers, 586-597 
length of the curve, 585 

J 

Jacobi, C. G. J., 198, 269, 282,486,574 
on Abel, 486, 574 
and Gauss, 269 
Jaeger, J. C., 169 
Jeans, Sir James, 490 
Jump discontinuities, 301, 361 

K 

Kac, Mark, 481 
Kant, Immanuel, 183, 269 
Kellogg, O. D., 433 
Kepler, Johannes, 149,183 
Kepler's law 

first law, 149-151 
second law, 148-149 
third law, 153-154 
Kernel 

Dirichlet, 345 

of integral transformations, 448 
Kinetic energy, 33, 608 


732 


Index 


Kirchhoff, Gustav R., 97 
Kirchhoff's law, 97 
Klein, F., 264 
Kolmogorov, A. N., 633 
Konigsberg bridges, 173,176 
Kruskal, M. D., 643 
Kummer, Ernst, 307 
Kutta, M. W., 657 


Lagrange, Joseph L., 606-607 
equations, 607, 611-614 
multiplier, 596-599 
variation of parameters, 135 
Lagrangian, 609 
Laguerre, Edmond, 245 
equation, 245, 392 
polynomials, 245 
Lambert, Johann H., 30, 385 

continued fraction for tangent, 420 
law of absorption, 30 
Lanczos, C., 266 
Laplace, Pierre S., 483-484, 619 
equation, 3, 371, 427 
two-dimensional, 373 
transform, 448 

algebraic equation, 458 
change of variable, 454 
derivatives and integrals, 463-468 
exponential order, 453-454, 459 
general properties, 466 
general transformation, 448 
improper integral, 449 
integral transformations, 448 
inverse, 460 
linearity, 450, 457 
piecewise continuous function, 
452-454 

power series, 450-451 
rational function, 454 
transform pairs, 461 
transformation, 448 
inverse, 460 

Law 

of absorption, Lambert's, 30 
conservation of energy, 613 
of gravitation, Newton's, 38, 

149,483,489 


Kepler's 

first, 149-151 
second, 148-149 
third, 153-154 
of mass action, 29 
of motion, Newton's second, 1,137, 
148, 609 
Ohm's, 96 
parallelogram, 336 
of refraction, Snell's, 42 
Lawyers, 263 

Least action, principle of, 609 
Least squares approximation, 

405-406 

Least time, Lermat's principle of, 42, 619 
Lebedev, N. N., 404 
Lebesgue, Henri, 289, 342 
Legendre, Adrien M., 263, 394, 445, 
485-486 

equation, 3, 212, 392 
expansion theorem, 404 
functions, 214 
polynomials, 214 
applications, 393 
binomial formula, 397 
gamma function, 393 
generating function, 398 
hypergeometric equation, 394 
least squares approximation, 
405-406 

Legendre series, 402-405 
nth polynomial, 395 
orthogonality, 400-402 
power series solutions, 394 
Rodrigues' formula, 397 
sphere, steady-state temperatures, 
430-433 
series, 402-405 
Leibniz, G. W., 180-181,185 
rule, 477 
Leigh, E. R., 805 
Levinson, N., 384 
Liapunov, A. M., 541, 551 
direct method, 541-546 
function, 543-545 
Libby, Willard, 25-26 
Lienard, Alfred, 569-570 
equation, 569 
theorem, 569-570, 576-580 
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Lindemann, F., 385 
Linear algebra, 117 
Linear combination, 110, 493 
Linear damping, 558 
Linear differential equations, 
95-98,107 
second order, 107 
Linear equations, 81-82 
Linearity, 450,457 
Linearization, 510, 548 
Linearly dependent, 113-114, 494 
Linearly independent, 113-114,494 
Linear spring, 557 
Linear systems 

general theory, 492 
homogeneous, 491,498-504 
linearly dependent, 494 
linearly independent, 494 
nonhomogeneous, 491 
stability for, 529-539 
Linear transformation, 448 
Linearization, 548 
method of, 510 
Liouville, Joseph, 384-386 
Liouville's theorem, 385 
Lipschitz, R., 633 
Lipschitz condition, 633, 640 
Lobachevsky, N., 269 
Local discretization error, 651 
Local truncation error, 653 
Locke, John, 183 
Logarithmic decrement, 144 
Lorentz, G. G., 275 
Lotka, A. J., 508 

M 

Major cases for critical points, 530 
Manuel, Frank E., 186 
Mass action, law of, 29 
Mathews, G. B., 440, 444 
Maxwell, James Clerk, 179, 267, 478 
Mead, D. G., 5, 385 
Mean convergence, 337 
Mean curvature, 617 
Mean distance, 153 
Mean square error, 337 
Mechanical problem, Abel's, 471 
Mechanistic determinism, 490 


Membrane, 435 

vibrating, Euler's theory of, 407, 
435-440 

Method of Frobenius, 224 
Method of linearization, 510 
Method of separation of variables, 358, 
368,372, 374,430,437 
Method of successive approximations, 
Picard's, 623 
Metric space, 333 
Millikan, Robert A., 161 
Minimal surfaces, Euler's 
problem of, 616 

Minimax property of Chebyshev 
polynomials, 274-275 
Minkowski, Hermann, 333-334 
inequality, 332 
Mixing, 24-26 
Morehead, J., 266 
Motion 

equation of, for undamped 
pendulum, 34-37, 560 
Newton's second law of, 1, 137, 

148, 609 

Motion of particle determination 
circular path, 31 
free fall, 32-33 
vertical path, 31 
Multiplier, Lagrange, 596-599 
Multiterm Taylor methods, 655-656 

N 

Natural frequency, 141 
Natural logarithms, 670-672 
77-body problem, 488 
Negative definite function, 542 
Negative semidefinite function, 542-543 
Newton, Isaac, 45,154,179-186,483 
law of cooling, 30 
law of gravitation, 146-154 
second law of motion, 1,103,137,148, 
357,435, 609 

Nodes, 363, 365,520-522, 531-532, 
534-536 

critical point, 520-521 
Nonexact equations, 74-78 
Nonhomogeneous equation, 109,127 
Nonhomogeneous linear systems, 491 
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Nonlinear mechanics, 557-562 
Nonlinear spring 
hard, 563 
soft, 563 

Nonlinear systems, 507-512, 547-555 
Normal distribution curve 
differential equation, 62-64 
examples, 61-62 
frequency density, 53 
histogram, 52-53 
improper integrals, 54-57 
mass density function, 53 
mean, 52-54, 58 

normal distribution function, 60 
normal probability density 
function, 57 
points of inflection, 58 
probability density function, 53 
standard deviation, 54, 58 
standard normal distribution, 60 
standard normal probability 
density, 59 

Normal form, differential equation, 191 
Normal frequencies, 161 
Normalized functions, 326, 379 
Normal probability density function, 57 
Norm of a function, 331 
Norm of a vector, 329 
Null function, 331 
Numbers, Bernoulli, 322 
Numerical methods 

benchmark problem, 645-646 
computational procedures, 645 
difference equation, 643-644 
discrete sequence of points, 645 
Euler method 
error, 650-652 

exact and numerical solutions, 
648-649 

geometric realization, 648 
integral of differential 
equation, 646 

piecewise-linear curve, 647-648 
system of equations, 662-663 
total relative error, 647 
zeroth order Taylor 
polynomial, 647 
Heun, 653 

higher order methods, 657-660 


improved Euler method, 652-657 
linear second order equations, 643 
multiterm Taylor, 655-656 
power series solutions, 643 
predictor-corrector, 653 
real-number line 

approximation, 644 
Runge-Kutta, 657-660, 663-664 
single-step, 645 

O 

Ohm, G. S., 96 
Ohm's law, 96 

One-dimensional heat equation, 368 
One-dimensional wave equation, 

351, 357 

One-parameter family of curves, 11 
Open interval, 108,195,389-390, 392 
Operator, differential, 162, 428 
inverse, 164 
Operator methods 

exponential shift rule, 168-169 
inverse operators, 163-164 
partial fractions decompositions, 
165-166 

series expansions, 166-168 
successive integrations, 164-165 
Order 

of differential equation, 2 
exponential, function of, 453 
Ordinary differential equation, 2-4 
Ordinary point, 210 
Ore, O., 486 

Orthogonal functions, 325-333 
sequence of, 325 
Orthogonality 

Bessel functions, 423-425 
Chebyshev polynomials, 273-274 
Fourier coefficients, 299 
Hermite polynomials, 256-258 
Legendre polynomials, 400-402 
Orthogonal sequence, 331 
complete, 341 

Orthogonal trajectories, 11-17 
Orthogonal vectors, 329 
Orthonormal sequence, 326 
Oscillator, harmonic, 258-261 
coupled harmonic, 155-160 
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Output function, 475-476 
Overdamped motion, 140 
Overdamped vibration, 140 

P 

Parabola, 151-152 
focal property of, 79 
Parallelogram law, 336 
Parameter(s), 11 

variation of, 133-135, 506 
Parseval des Chenes, M., 342-343 
Parseval's equation, 342 
Partial differential equation, 3; See 

also Heat equation; Laplace's 
equation; Wave equation 
Partial fractions decompositions of 
operators, 165-166 
Particular solutions 

exponential shift rule, 168-169 
Heaviside's methods, 162 
inverse operators, 163-164 
partial fractions decompositions of 
operators, 165-166 
series expansions of operators, 
166-168 

successive integrations, 164-165 
Partitions, theory of, 175 
Pascal, B., 45 
Path, 443, 515 

approaches the critical point, 543 
enters the critical point, 520 
Euler, 176 

global properties of, 564 
Pauling, Linus, 29,161 
Peano, Guiseppe, 633 
Peano's theorem, 633 
Pendulum, undamped, 34-37, 560 
Pepys, Samuel, 183 
Period, 35,138, 294, 564 
Periodic boundary conditions, 388 
Periodic function, 305 
Periodic solution, 563-570 
Periods of revolution of planets, 
153-155 

Phase 

plane, 514-517 
portrait, 517 
Philosophers, 264, 269 


Picard, Emile, 623 

method of successive 

approximations, 623-625 
theorem, 8-9, 488 

continuous function satisfying 
Lipschitz condition, 634-637 
first order linear equation, 634 
hypotheses, 628 
inequality, 629, 632-633 
Lipschitz condition, 633 
mean value theorem, 628, 632 
«th partial sum of series of 
functions, 627-628 
Peano's theorem, 633 
sequence of functions, 627 
series of constants, 630 
statement, 626 

uniform convergence, 630-631 
Piecewise continuous function, 452-454 
Piecewise smooth function, 349 
Planck, Max, 610 

Planetary motion, Bessel's studies 
of, 407 

Planets, periods of revolution, 153-154 
Plateau, ]., 617 
Plateau's problem, 617 
Poincare, Jules H., 198, 259, 513, 549, 

566,574-576 

Poincare-Bendixson theorem, 563-570 
Point 

critical, 516 

asymptotically stable, 527 
borderline cases for, 530, 534-539 
center, 523 
focus, 524 
isolated, 516 
major cases for, 530-534 
node, 520-521 
path approaches, 519 
path enters, 520 
saddle point, 522 
simple, 547-555 
spiral, 524-528 
stable, 527 
unstable, 527 
vortex, 523 
equilibrium, 527 
at infinity, 242-244 
ordinary, 210-219 
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singular, 210, 219 
irregular, 220 
regular, 219-226 

Pointwise convergence theorem, 
345-350 

Poisson, Simeon D., 377-378 
integral, 376-377 
Polya, G., 175,179, 595, 601 
Polyhedra 

Euler's formula for, 177 
regular, 177 
Polynomials 
auxiliary, 156 
Bernoulli, 323 

Chebyshev, 218, 242, 270-275 
minimax property of, 274-275 
orthogonality of, 273-274 
Hermite, 219, 250-261 

generating function of, 253 
Rodrigues' formula for, 255 
Laguerre, 245 

Legendre (see Legendre polynomials) 
Population growth, 21-22 
Populations, equilibrium of, 510 
Portrait, phase, 517 
Positive definite function, 542 
Positive semidefinite function, 542 
Potential, 427 
electrostatic, 428 
electrostatic dipole, 433-434 
gravitational, 428 
Potential energy, 608 
Potential theory, 371, 428 
Powers and roots, 676-678 
Power series, 197-204, 206-207, 450 
interval of convergence, 201 
radius of convergence, 200 
Predictor-corrector methods, 653 
Prey-predator equations, Volterra's, 
507-512 

Prime number theorem, 277 
Principle 

of conservation of energy, 32-33 
Dirichlet, 267, 283, 307 
Hamilton's, 285, 582, 608-611 
of least action, 609 
of least time, Fermat's, 42, 619 
of potential theory, 307 
of superposition, 133, 478 


Problem 

Abel's mechanical, 471 
air pressure, 30 
bacteria, 27 
bead on circle, 50 
boundary value, 109, 355, 380 
regular 384 
singular, 384 

brachistochrone, 40, 45, 47,184, 581 
brine, 24, 29, 84,101 
bugs on table, 51 
buoy, 144-145 
chain on table, 50 
chemical reaction, 29 
clepsydra, 49 
confocal conics, 49 
destroyer hunting submarine, 51 
Dirichlet, 373, 433 
for a circle, 372-379 
dog-rabbit, 92-93 
earth explodes, 155 
escape velocity, 38 
falling raindrop, 104 
football, 49 
geodesics 
on cone, 594 
on cylinder, 594, 606 
on sphere, 594 
hanging chain, 88, 90, 606 
hole drilled through earth, 39, 

88,145 

initial value, 125, 626, 638, 641, 650 
isoperimetric, 595-606 
Konigsberg bridge, 173,176 
Lambert's law of absorption, 30 
law of mass action, 29 
minimal surface 
Euler's, 616-617 
of revolution, 590-591 
mirror, 78-79 
mothball, 49 
n-body, 488 

Newton's law of cooling, 30 
one-dimensional wave, 362 
path of boat, 93-94 
Plateau's, 617 

President and Prime Minister, 51 
radioactive decay, 22-23 
radon seepage, 84 
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relativity, 104 
rocket, 103-104 
rope wound around post, 50 
rotating can of water, 50 
snowplow, 49 
Sturm-Liouville, 326 
regular, 384, 387, 392 
singular, 384, 392 
tank, 49 

tapered column, 51 
tautochrone, 473, 484 
terminal velocity, 37 
Torricelli's law, 49 
Torricelli's theorem, 48 
tractrix, 91, 95 
tunnel through earth, 145 
vibrating chain, 363-364 
Wren's theorem, 47 
Pseudosphere, 91 
Pure resonance, 145 
Pursuit curves, 88-94 
Pythagorean theorem, 336 

Q 

Quantized energy levels, 261 

R 

Radioactive decay, 22-23 

Radiocarbon, 25-26 

Radiocarbon dating technique, 25-26 

Radius of convergence, 200 

Rado, T., 617 

Radon seepage, 84 

Rainville, E. D., 281 

Rapoport, Anatol, 102 

Rate constant, 23 

Ratio test, 200 

Reaction 

first order, 22 
second order, 29 
Recursion formula, 213 
three-term, 217 
two-term, 216, 218 
Reduced equation, 109 
Reduction of order, 85-87 
Refraction, Snell's law of, 42 
Regular polyhedra, 177 


Regular singular points, 219-226, 
229-235 

Regular Sturm-Liouville problem, 384 
Relative error, total, 647 
Relativity, Einstein's special 
theory of, 104 
Resonance, 143 
frequency, 144 
phenomenon, 143-144 
pure, 145 
Response 

impulsive, 479 
indicial, 476 
Restoring force, 558 
Retarded fall, 33-34 
Riccati, J. F., 101 
equation, 101 
special, 426 

Riemann, Bernhard, 186,198, 

281-287,289, 299,304, 

307,334,354 

Riemann-Roch theorem, 283 
equation, 278-281 
identity, 281 
zeta function, 286 
Riesz, F., 342 

Riesz-Fischer theorem, 342 
Ritt, J. F., 5, 385,420 
Robbins, H., 179, 334 
Rodrigues, Olinde, 397 
Rodrigues' formula, 397 

for Hermite polynomials, 255 
for Legendre polynomials, 397 
Rogosinski, W., 304 
Round-off error, 650, 652 
Runge, Carl, 653, 657 
Runge-Kutta methods, 657-660, 
663-664 

S 

Saddle points, 522-523, 532 
Sansone, G., 545 
Sarton, George, 607 
Sawtooth functions, 453 
Schrodinger, Erwin, 259, 582 
wave equation, 259 
wave functions, 259 
Schuster, M. L., 576 
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Schwarz, H. A., 334 
inequality, 332 
Scribner, Charles, Jr., 575 
Second law, Kepler's, 149 
Second law of motion, Newton's, 1,103, 
137,148, 357,435, 609 
Second order linear equation, 

638-641 

Second order reaction, 29 
Section, conic, 151 
Seeley, R. T., 376 
Self-adjoint equations, 382, 387 
Separable equations, 6 
Separation constant, 431 
Separation theorem, Sturm, 190 
Separation of variables, method of, 17, 
358,368, 372,374,430,437 
Separatrix, 561 
Sequence 

complete, 331 
orthornormal, 326, 379 
Sequence of functions, orthogonal, 325 
Series 

Bessel, 422 
binomial, 208 
Chebyshev, 274 
convergent, 199 

expansions of operators, 166-168 
Fourier, 292, 327, 354 
cosine, 314 
sine, 314, 361 
Fourier-Bessel, 422 
Frobenius, 224 
Hermite, 258 
hypergeometric, 237 
Legendre, 402-405 
power, 199 
sum of, 199 
Taylor, 203 

Shifting formula, 460 
Shift rule, exponential, 168 
Simmons, George F., 292, 306, 322, 477 
Simple critical point, 547-557 
Simple discontinuity, 301 
Simple harmonic vibrations, 137-141 
Simpson's rule, 657-658 
Sine, Euler's infinite product for, 318 
Sine series, Fourier, 314, 361 


Single-step methods, 645 
Singular point, 210 
irregular, 220 
regular, 220 

Singular Sturm-Liouville problems, 384 
Smith, D. E., 284 
Smooth function, piecewise, 349 
Snell, Willebrord, 42 
law of refraction, 42 
Solution 

general, 9,110,113-119 
linearly dependent, 494 
linearly independent, 494 
particular, 9 
periodic, 563-572 
trivial, 110, 492 
Space, metric, 333 
Special functions, 197-198 
Special Riccati equation, 426 
Special theory of relativity, Einstein's, 
104 

Specific heat, 366 
Spherical Bessel functions, 420 
Spirals, 524-526, 532-534 
Spring 

linear, 557 
nonlinear 
hard, 563 
soft, 563 

Stable critical point, 527 
Standard form, differential equation, 

191 

Standard normal probability density, 59 

Standing waves, 365 

Stationary function, 587 

Stationary value, 587 

Steady-state, 143, 371 

Steinmetz, Charles Proteus, 146 

Step function, unit, 475 

Stephens, E., 169 

Stoker, J. J., 558 

String 

stretched, 351 
struck, 365 
vibrating, 356 
Sturm, J. C. F., 190 

comparison theorem, 194-196 
separation theorem, 187-193 
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Sturm-Liouville 
equation, 391 
expansion, 383 
problems, 379-384 
regular, 384, 392 
singular, 384, 392 
Successive approximations 
crude approximation, 622 
existence and uniqueness 
theorems, 625 
Picard's theorem, 626-637 
second order linear equation, 
638-641 

initial value problem, 621 
integral equation, 621-622 
Picard's method of, 623-625 
Successive integrations, 164-165 
Superposition, principle of, 133,478 
System 

autonomous, 513-519 
auxiliary equation of, 499, 530 
conservative dynamical, 557 
linear homogeneous, 491 
linear nonhomogeneous, 491 
uncoupled, 503 

Sz.-Nagy, Bela, 304, 326, 354, 376 
Szego, G., 601 

T 

Tangent, Lambert's continued fraction 
for, 393 
Tautochrone, 48 
problem, 473 
property, 48 

Taylor methods, multiterm, 655-656 
Taylor's formula, 203 
Taylor's series, 203-204 
Terminal velocity, 34 
Test 

comparison, 452 
ratio, 200 

Theory of partitions, 175 
Theory of relativity, Einstein's 
special, 104 

Thermal conductivity, 366 
Third law, Kepler's, 153-155,181 
Tietze, H., 263 


Titchmarsh, E. C, 287, 304, 384 
Toeplitz, O., 334 
Topology, 173, 283 
Torricelli, Evangelista, 48 
law, 49 
theorem, 48 

Total discretization error, 651 
Total relative error, 647 
Total truncation error, 658 
Tractrix, 91 
Trajectory, 515 

Transcendental functions, higher, 197 
elementary, 197 

Transcendental numbers, 385-386 
Transform, 448 

inverse Laplace, 460 
Laplace, 448 
Transformation, 448 
integral, 448 
inverse Laplace, 460 
Laplace, 448 
linear, 448 
Transient, 143 
Triangle inequality, 333 
Tricomi, F. G., 520, 549 
Trigonometric functions, 667-668 
Trivial solution, 110, 492 
Truesdell, C., 179,437 
Two-dimensional fluid motion, 

516-517 

Two-dimensional Laplace equation, 373 
Two-dimensional wave equation, 437 
Two-term recursion formulas, 216 

U 

Ulam, Stanislaw, 481 
Uncoupled system, 503 
Undamped pendulum, 560 
Undamped simple harmonic vibrations, 
137-138 

Undamped vibration, 137 
Underdamped vibration, 140 
Undetermined coefficients, 127-132 
Uniform convergence, 630 
Uniqueness theorem, 108 
Unit impulse function, 479 
Unit step function, 475 
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Universe 

Euler's attitude toward, 179 
Jeans' definition of, 490 
Unstable critical point, 527 


van der Pol, Balthasar, 570 
equation, 514, 572-573 
van der Waerden, B. L., 595 
Variable mass, 103-105 
Variables, method of separation of, 17, 
358,368,372,374,430,437 
Variation of parameters 

for linear equations, 133-135 
for linear systems, 506 
Vavilov, S. I., 184 
Vector(s) 

inner product of, 329 
norm of, 329 
orthogonal, 329 
Velocity 
escape, 38 
terminal, 34, 37 

Vibrating membrane, 421,435-440 
Vibrating string, 356 
stretched, 351 
struck, 365 
Vibration(s) 

critically damped, 140 
damped, 138-141 
definition, 136 
forced, 142-144 
free, 142 

overdamped, 140 
undamped simple harmonic 
vibrations, 137-138 
underdamped, 140 
Vicar of Bray, 484 
Voltaire, 171 


Volterra, Vito, 508 
Volterra's prey-predator equations, 
507-512 
Vortex, 523-524 

W 

Wallis, John, 172,185 
Waltershausen, W. S. von, 262 
Watson, G. N., 101, 281, 408, 416, 
420, 422 

Wave equation, 3, 428 

one-dimensional, 351, 357 
Bernoulli's solution, 361 
d'Alembert's solution, 363 
Schrodinger's, 261 
two-dimensional, 437 
Wave function, Schrodinger, 261 
Wave, standing, 365 
Weierstrass, Karl, 289, 334 
Weight function for orthogonal 
sequence, 379 
Westfall, Richard S., 186 
Whewell, William, 183 
Whittaker, E. T., 281 
Wilkes, J. Q, 658 

Wren, Sir Christopher, 47,181-182 
Wren's theorem, 47 
Wright, E. M., 175 
Wronski, Hoene, 115 
Wronskian, 115-117,134,189-190, 
493-494 

Z 

Zabusky, N. J., 643 

Zero of a function, 113 

Zeros of Bessel functions, 421-422 

Zeta function, Riemann's, 286 

Zeuner, F. E., 25 


